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SUMMARY 


This  report  describes  the  results  of  a  two-year  research  effort 
to  investigate  the  reconstruction  of  fine-resolution  images  of  space 
objects  using  earth-bound  optical  telescopes  despite  the  turbulence 
of  the  atmosphere.  The  results  of  this  research  are  an  indication 
that,  using  an  iterative  reconstruction  algorithm,  it  is  feasible  to 
reconstruct  diffraction-limited  images  from  the  Fourier  modulus  (or 
autocorrelation)  data  provided  by  stellar  speckle  interferometry. 

Methods  of  compensating  for  systematic  errors  and  noise  present 
in  astronomical  data  were  developed  and  were  applied  to  binary  star 
data,  from  which  a  diffraction-limited  image  was  successfully  recon¬ 
structed.  It  was  analytically  shown  that  the  following  types  of 
objects  are  uniquely  specified  by  their  Fourier  modulus:  objects 
consisting  of  separated  parts  satisfying  certain  disconnection  condi¬ 
tions,  objects  consisting  of  collections  of  distinct  points,  and 
objects  with  radial  symmetry.  Experimental  indications  are  that 
complicated  two-dimensional  objects  are  usually  unique.  An  equiva¬ 
lence  of  the  error-reduction  iterative  reconstruction  algorithm  and 
the  steepest-descent  gradient  search  method  was  shown,  and  the  con¬ 
vergence  of  this  algorithm  was  proven.  However,  the  input-output 
iterative  algorithm  and  other  gradient  search  algorithms  were  found 
to  converge  much  faster  in  practice.  A  survey  of  other  applications 
of  the  iterative  algorithm  was  performed,  and  it  was  found  to  be 
remarkably  useful  in  solving  problems  for  a  wide  range  of  applica¬ 
tions.  A  new  noniterative  method  was  developed  for  reconstructing 
the  support  of  an  object  from  the  support  of  its  autocorrelation; 
and  for  objects  consisting  of  a  collection  of  distinct  points,  a  new 
noniterative  method  was  developed  for  reconstructing  the  object  using 
a  product  of  three  translates  of  the  autocorrelation  function. 
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FINE  RESOLUTION  IMAGING  OF  SPACE  OBJECTS 


1 

INTRODUCTION  AND  OBJECTIVES 

This  report  describes  the  results  of  a  two-year  research  effort 
to  investigate  a  method  of  obtaining  high  resolution  images  of  space 
objects  using  earth-bound  optical  telescopes. 

Atmospheric  turbulence  severely  limits  the  resolution  of  large 
earth-bound  optical  telescopes.  Under  good  "seeing"  conditions,  the 
resolution  allowed  by  the  atmosphere  is  typically  one  second  of  arc, 
compared  with  0.02  seconds  of  arc,  the  theoretical  diffraction- 
limited  resolution  of  a  five-meter  diameter  telescope.  That  is,  the 
potential  exists  for  obtaining  images  having  fifty  times  finer  reso¬ 
lution  than  what  one  ordinarily  can  obtain. 

Several  interferometric  methods  are  capable  of  providing 
diffraction-limited  information  through  atmospheric  turbulence.  The 
most  promising  of  these  interferometric  methods  is  Labeyrie's  stellar 
speckle  interferometry.  The  high-resolution  information  provided  by 
these  methods  is  the  modulus  of  the  Fourier  transform  of  the  object; 
the  phase  of  the  Fourier  transform  is  lost.  Unfortunately,  except 
for  the  very  special  case  in  which  an  unresolved  star  is  very  near 
the  object  of  interest,  the  Fourier  modulus  can  be  used  to  directly 
compute  only  the  autocorrelation  of  the  object,  but  not  the  object 
itself.  The  autocorrelation  is  ordinarily  useful  only  for  determin¬ 
ing  the  diameter  of  the  object  or  the  separation  of  a  binary  star 
pair. 

We  have  developed  an  algorithm  for  reconstructing  the  object's 
spatial  (or  angular)  brightness  distribution  from  its  Fourier  modu¬ 
lus.  The  algorithm  relies  both  on  the  Fourier  modulus  data  measured 
by  stellar  speckle  interferometry  and  on  the  a  priori  constraint 
that  the  object  distribution  is  a  nonnegative  function.  Therefore, 
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the  combination  of  stellar  speckle  interferometry  with  the  iterative 
algorithm  can  provide  fine-resolution  diffraction-limited  images 
despite  the  presence  of  atmospheric  turbulence. 

The  major  goal  of  this  two-year  research  effort  were  threefold: 
(1)  to  improve  the  iterative  reconstruction  algorithm  to  make  it 
operate  reliably  in  near-real-time  on  imperfect  real-world  data;  (2) 
to  determine  the  uniqueness  of  the  solution  under  various  conditions; 
and  (3)  to  demonstrate  the  reconstruction  technique  with  real-world 
interferometer  data,  thereby  providing  images  with  finer  resolution 
than  would  ordinarily  be  possible. 
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RESEARCH  ACCOMPLISHMENTS 


An  overview  of  the  problem  of  image  reconstruction  for  stellar 
interferometry  is  given  in  Appendix  A.  It  serves  as  a  background 
for  the  following  description  of  the  research  accomplishments.  It 
is  reprinted  from  Reference  1. 

The  research  effort  can  be  divided  into  five  major  topics: 

1.  Stellar  speckle  interferometer  data  was  acquired,  evaluated, 

and  processed  into  imagery;  and  methods  were  developed  for 

minimizing  the  effects  of  the  types  of  noise  and  imperfec- 

2 

tions  found  in  that  data  . 

2.  Analytical  (and  to  a  lesser  extent  computer)  studies  of  the 

TAB 

uniqueness  problem  were  performed  * 

3.  A  new  method  was  developed  for  reconstructing  the  support  of 
an  object;  and  for  objects  consisting  of  a  collection  of 
point-like  sources,  a  new  noniterative  method  was  developed 

C  *7 

for  reconstructing  the  object  *  . 

4.  A  nuirtoer  of  variations  of  the  iterative  algorithm  were 

O  Q 

studies  and  compared  with  gradient  search  methods 

5.  Numerous  other  applications  of  the  iterative  algorithm  were 
identified^®. 

Publications  arising  from  this  research  effort  are  listed  as 
References  1-10  at  the  end  of  this  section. 

The  results  obtained  for  each  of  the  five  topics  listed  above 
are  summarized  in  the  five  respective  sections  that  follow. 
References  1,  2,  and  4-10  are  included  as  Appendices  A-I,  respec¬ 
tively.  For  the  sake  of  brevity.  Reference  3  is  not  repeated  here; 
it  was  included  in  Appendix  B  of  the  Interim  Report^ 
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2.1  PROCESSING  STELLAR  SPECKLE  INTERFEROMETRY  DATA 


Processing  experiments  were  performed  on  data  provided  by  the 
Steward  Observatory  Stellar  Speckle  Interferometry  Program  (via 
K.  Hege,  Steward  Observatory).  Methods  were  developed  to  compensate 
for  errors  in  the  data  in  order  to  arrive  at  a  good  estimate  of  the 
Fourier  modulus.  It  was  previously  known  that  it  is  necessary  to 
compensate  the  Fourier  modulus  data  for  a  certain  noise  bias  term 
due  to  photon  noise.  Using  the  Steward  Observatory  data,  it  was 
found  that  the  detection  process  resulted  in  a  frequency  transfer 
function,  which  we  call  the  detection  transfer  function,  which,  in 
addition  to  being  an  error  itself,  prevented  the  compensation  of  the 
noise  bias.  Methods  for  determining  the  detection  transfer  function 
from  the  data  and  compensating  for  it  were  developed.  Methods  for 
compensating  other  systematic  errors  were  also  deveoped.  These 
methods  were  applied  to  stellar  speckle  interferometry  data  of  a 
binary  star  system,  and  a  diffraction-limited  image  was  successfully 
reconstructed  from  the  resulting  compensated  Fourier  modulus  data. 
These  results  are  described  in  detail  in  Appendix  B. 

Having  gained  this  experience  with  single  and  binary  star  data, 
the  next  step  will  be  to  use  the  same  methods  on  more  complicated 
objects,  such  as  asteroids  or  Jovian  moons. 

Data  from  the  Anglo-Austral ian  Telescope  (via  J.C.  Dainty, 
University  of  Rochester)  in  the  form  of  many  short-exposure  images 
on  16  mm  movie  film  was  also  received.  The  16  mm  film  was  cut  into 
strips  and  contact  copied,  along  with  grey-scale  step  wedges,  onto 
9x9  inch  sheets  of  film.  The  9-inch  sheets  of  film  were  digitized 
at  the  Image  Processing  Institute  at  the  University  of  Southern 
California  using  an  Optronics  digitizer  set  at  25  micron  sample  spac¬ 
ing.  It  will  be  necessary  to  develop  software  for  extracting  the 
desired  information  from  the  large  (4000  x  8000  pixel)  digitized 
arrays.  No  further  work  was  done  on  this  data  because  efforts  were 


concentrated  on  the  Steward  Observatory  data  which  was  in  a  much  more 
convenient  form.  We  hope  to  process  this  data  in  the  future. 

2.2  UNIQUENESS  THEORY 

The  uniqueness  of  images  reconstructed  from  Fourier  modulus  in¬ 
formation  was  explored  using  the  theory  of  analytic  functions.  As 
described  in  Appendix  A  and  in  more  detail  in  Appendix  B  of  the 
Interim  Report^\  for  the  one-dimensional  case  there  are  usually 
many  different  objects  having  the  same  Fourier  modulus.  Examples  of 
both  unique  and  non-unique  cases  can  be  constructed.  Most  objects 
consisting  of  separated  parts  can  be  shown  to  be  uniquely  specified 
by  their  Fourier  modulus.  However,  the  separation  distance  is  more 
stringent  than  vnat  was  previously  thought,  as  shown  in  Appendix  C. 
It  has  been  shown  that  a  two-dimensional  object  having  radial  sym¬ 
metry  is  uniquely  specified  by  its  Fourier  modvlus,  as  discussed  in 
Appendix  D.  Also,  for  objects  consisting  of  collections  of  distinct 
points,  the  solution  is  usually  unique,  as  described  in  Appendix  F. 

It  was  found  that  when  cases  were  purposely  constructed  to  have 
two  different  solutions,  then  the  iterative  algorithm  would  converge 
to  one  of  the  solutions  in  about  half  of  the  trials  and  converge  to 
the  other  solution  in  the  other  half  of  the  trials,  depending  on  the 
random  number  sequence  used  as  thi  initial  input  to  the  algorithm. 
Therefore,  it  is  believed  that  in  general,  if  there  are  multiple 
solutions,  then  the  iterative  algorithm  is  likely  to  find  any  one  of 
them.  Our  experience  with  complicated  two-dimensional  objects  has 
been  that  the  algorithm  in  almost  all  cases  converged  to  the  original 
object  itself,  and  not  to  other  solutions.  This  is  a  strong  indica¬ 
tion  that,  in  practice,  other  solutions  usually  do  not  exist  in  the 
two-dimensional  case. 


2.3  NEW  NON-ITERATIVE  METHODS  FOR  SUPPORT  AND  OBJECT  RECONSTRUCTION 


A  new  method  was  developed  for  reconstructing  the  support  of  an 
object  (the  set  of  points  at  which  it  is  nonzero)  from  the  support 
of  its  autocorrelation.  In  some  instances,  for  example  to  find  the 
relative  locations  of  a  collection  of  point-like  stars,  the  object's 
support  is  the  desired  information.  More  generally,  once  the  ob¬ 
ject's  support  is  known,  then  the  complete  reconstruction  of  the 
object  by  the  iterative  method  is  simplified.  Several  methods  are 
shown  of  finding  sets  which  contain  all  possible  support  solutions. 
Particularly  small  and  informative  sets  containing  the  solutions  are 
given  by  the  intersections  of  two  translates  of  the  autocorrelation 
support.  For  the  special  case  of  convex  objects,  the  intersection 
of  three  translates  of  the  autocorrelation  support  generates  a  family 
of  solutions  to  the  support  of  the  object.  For  the  special  case  of 
an  object  consisting  of  a  collection  of  distinct  points  satisfying 
certain  nonredundancy  conditions,  the  intersection  of  three  trans¬ 
lates  of  the  autocorrelation  support  generates  a  unique  solution. 
In  addition,  for  these  same  objects,  by  taking  the  product  of  three 
translates  of  the  autocorrelation  function,  one  can  reconstruct  the 
object  itself  in  addition  to  reconstructing  the  support  of  the  ob¬ 
ject.  These  results  are  described  in  detail  in  Appendix  F  (which 
contains  further  results  and  proofs  not  included  in  the  Interim 
Report). 

2.4  ITERATIVE  ALGORITHM  VARIATIONS  AND  GRADIENT  SEARCH  METHODS 

Gradient  search  methods  for  solving  the  image  reconstruction 
(phase  retrieval)  problem  were  studied.  It  was  found  that  the 
steepest-descent  method  using  an  "optimum"  step  size  is  exactly 
equivalent  to  one  version  of  the  iterative  algorithm  called  the 
error-reduction  algorithm.  In  addition,  it  was  proven  that  the 
error-reduction  algorithm  converges  in  the  sense  that  after  each 
iteration,  the  error  of  the  output  image  is  less  than  or  equal  to 


the  error  at  the  previous  iteration.  However,  in  practice  the  error- 
reduction  algorithm  converges  very  slowly  for  this  application,  mak¬ 
ing  it  impractical.  Algorithms  that  converge  much  faster  than  the 
error-reduction  algorithm  and  that  are  practical  for  this  application 
are  the  input-output  family  of  iterative  Fourier  transform  algorithms 
and  the  conjugate  gradient  method.  The  algorithm  found  to  converge 
the  fastest  is  called  the  hybrid  input-output  algorithm.  A  partial 
reconstruction  of  the  object  is  available  after  just  a  few  itera¬ 
tions,  and  the  complete  reconstruction  takes  under  one  hundred  itera¬ 
tions,  which,  for  array  sizes  of  128  x  128  pixels,  takes  less  than 
two  minutes  on  a  Floating  Point  Systems  AP-120B  array  processor. 
The  gradient  search  methods  deserve  more  development;  the  computation 
of  the  gradient  can  be  accomplished  rapidly  using  the  method  of  two 
Fourier  transforms.  The  problem  of  phase  retrieval  from  two  inten¬ 
sity  measurements  was  also  considered  in  the  same  context.  These 
results  are  described  in  detail  in  Appendix  H. 

2.5  ADDITIONAL  APPLICATIONS  OF  THE  ITERATIVE  ALGORITHM 

In  addition  to  the  image  reconstruction  problem  for  the  fine- 
resolution  imaging  of  space  objects  through  atmospheric  turbulence, 
there  are  a  number  of  other  difficult  problems  which  can  be  solved 
by  algorithms  closely  related  to  the  iterative  Fourier  transform 
algorithm.  Such  algorithms  can  be  used  to  solve  problems  which  can 
be  stated  as  follows:  given  a  set  of  constraints  on  a  function  (an 
object,  wavefront,  or  a  signal)  and  another  set  of  constraints  on 
its  (Fourier)  transform,  find  a  transform  pair  that  satisfies  both 
sets  of  constraints.  Here,  the  word  “constraints"  is  meant  to  in¬ 
clude  both  measured  data  and  a  priori  information.  Several  applica¬ 
tions  falling  under  this  general  problem  statement  were  identified. 
They  include  image  reconstruction  (phase  retrieval  problems)  in 
astronomy,  electron  microscopy,  wavefront  sensing.  X-ray  crystallog¬ 
raphy  and  computed  topography,  and  the  synthesis  of  pupil  functions. 


computer-generated  holograms,  radar  signals,  digital  filters,  and 
other  applications.  A  summary  of  these  applications  is  given  in 
Appendix  I.  The  family  of  iterative  Fourier  transform  algorithms 
appears  to  be  an  extremely  valuable  mathematical  tool  which  can  be 
expected  to  receive  a  great  deal  of  attention  in  the  coming  years. 

2.6  SUMMARY  AND  CONCLUSIONS 

The  major  conclusion  of  this  research  effort  is  that,  for  a  wide 
set  of  objects,  the  reconstruction  of  fine-resolution  images  by  the 
combination  of  the  iterative  reconstruction  algorithm  with  stellar 
speckle  interferometry  data  appears  to  be  feasible.  Excellent  recon¬ 
struction  results  have  been  obtained  from  computer-simulated  data  of 
complicated  objects  and  from  real  telescope  data  of  a  simple  object. 
This  would  make  possible  a  fifty-fold  increase  in  resolution  for  an 
optical  telescope  of  five  meters  diameter.  Systematic  errors  in  the 
collection  of  the  data  can  be  detected  and  compensated,  although 
further  effort  is  required  to  demonstrate  these  techniques  on  com¬ 
plicated  objects.  For  complicated  two-dimensional  objects,  the 
Fourier  modulus  data  does  appear  to  uniquely  specify  the  object, 
although  a  definitive  theory  is  not  available  yet.  Special  cases 
that  have  been  analyzed,  including  objects  of  separated  support, 
objects  consisting  of  collections  of  discrete  points,  and  radially 
symmetric  objects,  were  shown  to  be  usually  unique.  A  new  nonitera¬ 
tive  method  for  support  and  object  reconstruction,  particularly  use¬ 
ful  for  objects  consisting  of  collections  of  discrete  points,  was 
invented.  A  better  understanding  of  the  iterative  algorithm  and  its 
relationship  to  gradient  search  methods  was  developed.  Finally,  it 
was  shown  that  algorithms  related  to  the  iterative  algorithm  are 
applicable  to  a  wide  variety  of  problems  and  deserve  further  develop¬ 
ment  for  those  applications  as  well. 
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IMAGE  RECONSTRUCTION  FOR  STELLAR  INTERFEROMETRY 

by 

J.  R.  Fienup 

Radar  and  Optics  Division 
Environmental  Research  Institute  of  Michigan 
P.O.Box  8618,  Ann  Arbor,  Michigan  48107  USA 

Interferometric  methods  (Michel son  &  Pease  1921,  Hanbury  Brown  & 
Twiss  1956,  Labeyrie  1970,  Gezari  et  al.  1972,  Currie  et  al.  1974) 
are  capable  of  measuring  the  modulus  of  the  Fourier  transform  (or 
equivalently,  the  autocorrelation)  of  an  astronomical  object  for 
resolution  out  to  the  diffraction  limit  of  the  telescope  aperture 
(0.02  seconds  of  arc  for  a  5-meter  aperture)  although  atmospheric 
turbulence  limits  the  measurement  of  the  Fourier  phase  to  a  resolu¬ 
tion  of  about  one  second  of  arc.  Except  for  the  special  cases  of 
objects  known  to  be  centro-symmetric  or  objects  having  a  neighboring 
unresolved  star  within  the  same  isoplanatic  patch  (Liu  &  Lohmann 
1973,  Weigelt  1980)  it  is  not  possible  to  directly  compute  the  ob¬ 
ject  from  its  Fourier  modulus. 


ITERATIVE  RECONSTRUCTION  METHOD 

Recently  an  iterative  method  (Fienup  1978,  Fienup  1979)  has  been 
successfully  used  to  reconstruct  an  object  from  its  Fourier  modulus 
(or  equivalently,  to  retrieve  the  Fourier  phase),  using  the  fact 
that  an  object's  brightness  spatial  distribution  is  nonnegative. 
The  method  works  for  any  arbitrary  object  in  one  or  more  dimensions. 
As  depicted  in  Figure  1,  the  algorithm  successively  Fourier  trans¬ 
forms  back  and  forth  between  the  object  domain  and  the  Fourier  do¬ 
main,  modifying  the  estimate  according  to  the  known  information  in 
each  domain.  The  iterations  are  usually  started  with  an  array  of 
random  nunbers  as  the  initial  estimate  of  the  object.  In  the 
Fourier  domain,  the  computed  Fourier  modulus  is  replaced  by  the 
measured  Fourier  modulus,  and  the  computed  phase  is  unaltered.  In 
the  object  domain  several  different  methods  for  picking  a  new  object 
estimate  have  proven  successful.  The  simplest  method  is  to  force 
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the  object  to  satisfy  the  nonnegativity  constraint,  i.e.,  set  it 
equal  to  zero  wherever  it  is  negative.  One  can  also  set  it  equal 
to  zero  outside  of  the  region  of  support  of  the  object  if  that  in¬ 
formation  is  known.  Although  it  can  be  shown  that  the  algorithm 
converges  when  using  this  simple  method  of  picking  a  new  estimate 
for  the  object,  there  are  also  more  sophisticated  methods  of  apply¬ 
ing  the  object-domain  constraints  which  in  practice  result  in  much 
faster  convergence  of  the  algorithm  (Fienup  1978,  Fienup  1979). 
The  algorithm  converges  to  a  Fourier  transform  pair  satisfying  the 
constraints  in  both  domains.  The  image  is  nonnegative  and  the 
modulus  of  its  Fourier  transform  equals  the  measured  Fourier 
modulus. 

Figure  2  shows  a  computer  experiment  testing  this  reconstruction 
method  on  a  realistic  simulation  of  the  Fourier  modulus  data  pro¬ 
vided  by  stellar  speckle  interferometry,  including  the  effects  of 
atmospheric  turbulence  and  photon  noise  (Feldkamp  &  Fienup  1980). 
An  undegraded  object,  a  digitized  photograph  of  a  satellite  shown 
in  Figure  2(a),  was  convolved  with  156  different  point-spread  func¬ 
tions  to  produce  156  different  blurred  images.  Each  of  the  point- 
spread  functions  represented  a  different  realization  of  the  effects 
of  the  turbulent  atmosphere.  The  blurred  images  were  then  subjected 
to  a  Poisson  noise  process  to  simulate  the  effects  of  photon  noise. 
Two  of  the  resulting  156  degraded  images  are  shown  in  Figures  2(b) 
and  2(c).  The  degraded  images  were  then  processed  by  Labeyrie's 
(1970)  method,  as  modified  by  Goodman  and  Belsher  (1976,  Dainty  & 
Greenway  1979).  The  estimate  of  the  modulus  of  the  Fourier  trans¬ 
form  of  the  object  is  given  by  (Feldkamp  4  Fienup  1980) 


where  Ik(UiV)  is  the  Fourier  transform  of  the  k^*’  degraded 
image,  is  the  total  number  of  photons  detected  (it  is  sub¬ 
tracted  in  order  to  compensate  for  a  certain  noise  bias  term  that 
arises  in  the  power  spectrum  due  to  photon  noise) (Goodman  4  Belsher 
1976),  S|((u,  v)  is  the  Fourier  transform  of  the  ki^i’  point- 
spread  function  (to  provide  compensation  for  the  MTF  of  the  speckle 
interferometry  process)  and  the  weighting  function  W(u,  v)  is  the 
MTF  due  to  the  telescope  aperture.  The  denominator  of  this  expres¬ 
sion  is  obtained  by  making  measurements  on  a  reference  star  through 
an  atmosphere  having  the  same  statistics  as  that  which  blurred  the 
images  or  by  using  a  model  of  the  effects  of  atmospheric  turbulence 
(Korff  et  al.  1972).  W(u,  v)  was  included  in  order  to  restore  the 
natural  MTF  due  to  the  telescope  aperture  which  was  removed  by  the 
denominator  of  the  equation  above.  Figure  2(d)  shows  the  resulting 
Fourier  modulus  estimate.  In  a  realistic  situation  one  would  also 


have  to  compensate  for  the  MTF  of  the  detection  process  (Aime  et 
al.  1979,  Fienup  &  Feldkamp  1980).  The  Fourier  modulus  estimate 
was  processed  by  the  iterative  method,  resulting  in  the  recon¬ 
structed  image  shown  in  Figure  2(e),  which  agrees  very  well  with 
the  original  object.  This  result  was  obtained  assuming  a  telescope 
diameter  of  1.2  meters  and  300,000  photons  per  degraded  image,  which 
is  realistic  for  extended  objects  of  this  type. 


UNIQUENESS 

Even  if  one  does  obtain  an  image  that  is  nonnegative  and  has  the 
correct  Fourier  modulus,  the  question  remains:  is  the  solution 
unique?  For  the  one-dimensional  case  it  is  well  known  from  the 
theory  of  analytic  functions  that  there  are  ordinarily  an  enormous 
number  of  different  solutions.  (Translations  and  mirror  images  of 
the  object  all  have  the  same  Fourier  modulus  and  are  not  considered 
to  be  different  solutions.)  If  the  analytic  continuation  of  the 
Fourier  transform  off  the  real  line  has  M  non-real  zeroes,  then 
there  are  2^"'  different  solutions  (Walther  1963,  Hofstetter 
1964).  Imposing  the  condition  of  nonnegativity  may  greatly  reduce 
the  number  of  solutions  (Bates  1969),  but  a  theory  has  not  yet  been 
developed  that  indicates  how  many  nonnegative  solutions  there  would 
be. 

For  some  special  classes  of  one-dimensional  objects  the  solution 
is  usually  unique.  One  such  class  is  that  of  objects  of  separated 
support,  (Greenaway  1977,  Bates  1978)  providing  that  the  parts  of 
the  object  are  separated  by  a  sufficient  distance  (Crimmins  &  Fienup 
1981).  Another  such  class  is  that  of  objects  consisting  of  collec¬ 
tions  of  discrete  points  (like  stars)  randomly  distributed  within 
some  finite  region  of  space.  Unless  the  spacings  of  the  discrete 
points  satisfy  certain  redundancy-type  conditions,  not  only  is  the 
solution  unique,  but  also  there  exists  a  very  simple  noniterative 
reconstruction  method,  which  involves  the  product  of  three  trans¬ 
lates  of  the  object's  autocorrelation  function  (Fienup  et  al.  1981). 
This  reconstruction  method  also  works  for  the  case  of  two  or  more 
dimensions. 

The  uniqueness  situation  appears  to  be  much  different  in  two 
dimensions  than  in  one  dimension.  Whereas  in  one  dimension  unique¬ 
ness  is  unusual,  in  two  (or  more)  dimensions  uniqueness  appears  to 
be  the  rule,  and  nonuniqueness  is  unusual.  The  difference  between 
one  and  two  dimensions  can  be  understood  from  the  results  of  Bruck 
and  Sodin  (1979).  They  considered  the  special  case  of  a  sampled 
object,  one  consisting  of  a  finite  array  of  delta  functions  (each 
having  a  different  amplitude)  on  a  rectangular  lattice.  The  Fourier 
transform  of  such  an  object  can  easily  be  shown  to  be  a  polynomial. 
It  can  also  be  shown  that  the  number  of  different  solutions  having 
the  same  Fourier  modulus  is  2^“'  where  M  is  the  number  of  irre¬ 
ducible  factors  into  which  the  polynomial  can  be  factored  (not  all 
of  these  solutions  are  necessarily  nonnegative).  Since  a  one¬ 
dimensional  polynomial  of  order  M  can  always  be  factored  into  M 
irreducible  factors  (the  roots  of  the  polynomials  can  be  related  to 
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Figure  1.  Block  Diagram  of  the  Iterative 
Method  for  Reconstructing  an  Object  from 
the  Modulus  of  Its  Fourier  Transform. 


Figure  2.  Reconstruction  Experiment  for  Stellar  Speckle  Interfer¬ 
ometry.  (A)  Undegraded  Object;  (8),  (C)  Simulated  Images  Degraded 
by  Atmospheric  Turbulence  and  Photon  Noise;  (0)  Fourier  Modulus 
Estimate;  (E)  Image  Reconstructed  from  Fourier  Modulus. 


the  complex  zeroes  mentioned  earlier),  the  number  of  possible  solu¬ 
tions  is  2*^“'  in  the  one-dimensional  case.  On  the  other  hand, 
two  dimensional  polynomials  having  arbitrary  coefficients  are  only 
rarely  factorable.  Consequently,  the  two-dimensional  case  is  usu¬ 
ally  unique  although  the  one-dimensional  case  is  usually  not 
unique.  Efforts  have  also  been  made  to  extend  these  results  to  the 
case  of  continuous  objects  (Lawton  1980). 

Of  course,  one  can  always  invent  two-dimensional  examples  that 
are  not  unique.  One  way  to  do  this  is  to  form  an  object  as  the 
convolution  of  one  nonnegative  function  with  a  second  nonnegative 
function.  In  the  Fourier  domain,  one  is  multiplying  the  Fourier 
transforms  of  the  two  functions.  A  second  solution  is  then  given 
by  the  convolution  of  the  first  function  with  the  mirror  image  of 
the  second  function.  In  the  Fourier  domain,  one  has  the  product  of 
the  Fourier  transform  of  the  first  function  with  the  complex  conju¬ 
gate  of  the  Fourier  transform  of  the  second  function.  Since  the 
complex  conjugation  does  not  change  the  Fourier  modulus,  both  solu¬ 
tions  have  the  same  Fourier  modulus.  Another  method  of  generating 
cases  having  multiple  solutions  is  given  by  Huiser  and  vanToorn 
(1980).  Despite  the  fact  that  nonunique  cases  can  be  constructed, 
it  appears  to  be  true  that  for  most  two-dimensional  objects  taken 
from  the  real  world  the  solution  will  be  unique.  Experimental  evi¬ 
dence  upholds  this  view;  for  a  variety  of  objects  reconstructed  by 
the  iterative  method  so  far,  the  two-dimensional  reconstructed  im¬ 
ages  usually  differ  very  little  from  the  original  object  (Fienup 
1979).  Furthermore,  the  presence  of  noise  in  the  Fourier  modulus 
data  has  had  the  effect  of  causing  the  reconstructed  image  to  be 
noisy  rather  than  having  it  converge  to  an  essentially  different 
solution  (Feldkamp  &  Fienup  1980). 

In  those  cases  for  which  there  are  multiple  solutions,  however, 
then  the  iterative  algorithm  converges  to  different  members  of  the 
set  of  solutions,  depending  on  the  random  numbers  used  to  initialize 
the  algorithm.  Therefore  by  multiple  trials  of  the  algorithm  one 
can  find  a  number  of  different  solutions  if  they  exist.  If  there 
is  only  one  solution,  then  the  algorithm  will  converge  only  to  that 
solution. 


ALTERNATIVE  METHODS 

Although  the  iterative  algorithm  appears  to  be  the  only  practical 
reconstruction  method  for  general  two-dimensional  objects,  there 
are  some  other  methods  which  may  be  useful  in  certain  cases.  For 
collections  of  point-like  stars,  the  method  of  using  the  product  of 
three  autocorrelations  (Fienup  et  al.  1981)  would  usually  work  well, 
as  would  the  iterative  method  of  Baldwin  and  Warner  (1978)  which  is 
the  same  as  one  used  in  X-ray  crystallography.  When  only  a  small 
number  of  resolvable  elements  is  involved,  then  an  iterative  Newton- 
Raphson  method  is  practical  (Frieden  S  Currie  1976).  If  the  signal- 
to-noise  ratio  is  high  enough,  then  the  Napier  and  Bates  (1974) 
method  of  tracking  the  complex  zeroes  of  projections  of  the  image 
may  be  feasible. 


If  one  utilizes  all  the  individual  short-exposure  images  in 
stellar  speckle  interferometry,  instead  of  trying  to  reconstruct 
the  image  from  the  Fourier  modulus  only,  then  additional  recon¬ 
struction  methods  are  possible.  By  the  Knox-Thompson  (1974, 
Stachnik  et  al .  1977)  technique,  one  averages  over  phase  differences 
between  adjoining  points  in  the  Fourier  transforms  of  the  short- 
exposure  images,  then  integrates  over  the  averaged  phase  differences 
to  arrive  at  an  estimate  of  the  Fourier  phase.  When  the  image  in¬ 
cludes  one  star  that  is  considerably  brighter  than  the  others,  then 
the  shift-and-add  method  of  Bates  and  Cady  (1980,  Cady  s  Bates  1980) 
should  work  well.  When  the  diameter  of  the  object  is  only  a  couple 
of  times  the  size  of  a  diffraction-limited  resolution  element,  then 
the  shift-and-add  method  of  Worden,  Lynds  and  Harvey  (1976)  may  be 
useful . 


CONCLUSIONS 

From  both  theoretical  and  experimental  viewpoints,  it  appears 
that  the  reconstruction  of  diffraction-1 imited  images  of  astronomi¬ 
cal  objects  should  be  feasible  using  stellar  speckle  interferometry 
combined  with  the  iterative  algorithm.  The  iterative  algorithm 
converges  fast  enough  to  make  it  practical  to  reconstruct  images  of 
complicated  two-dimensional  objects,  and  the  solution  has  been  found 
to  be  usually  unique  for  the  two-dimensional  case.  The  reconstruc¬ 
tion  method  is  not  overly  sensitive  to  noise,  making  it  practical 
for  reconstructing  images  from  real-world  telescope  data. 
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Abstract 


Diffraction-limited  images,  of  resolution  many  times  finer  than  what  is  ordinarily  ob¬ 
tainable  through  large  earth-bound  telescopes,  can  be  obtained  by  first  measuring  the  modu¬ 
lus  of  the  Fourier  transform  of  an  object  by  the  method  of  Labeyrie's  stellar  speckle 
interferometry,  and  then  reconstructing  the  object  by  an  iterative  method.  Before  recon¬ 
struction  is  performeo,  it  is  first  necessary  to  compensate  for  weighting  functions  and 
noise  in  order  to  arrive  at  a  good  estimate  of  the  object’s  Fourier  modulus.  A  simple  al¬ 
ternative  to  Worden's  method  of  compensation  for  the  MTF  of  the  speckle  process  is  de¬ 
scribed.  Experimental  reconstruction  results  are  shown  for  the  binary  star  system  SAD 
9A163. 


Introduction 

As  discussed  in  several  papers  in  this  session  on  Stellar  Speckle  Interferometry, 
Labeyrie’s  method^  of  processing  many  short-exposure  Images  can  be  used  to  arrive  at  a 
di f fract ion-i imited  estimate  of  the  Fourier  modulus  of  an  astronomical  object  despite  the 
presence  of  atmospheric  turbulence.  Since  tne  diffraction  limit  of  a  iarge-aperture  tele¬ 
scope  is  many  times  finer  than  the  resolution  ordinarily  obtainable  through  the  atmosphere 
at  optical  wavelengths,  stellar  speckle  interferometry  has  the  potential  for  providing 
images  having  many  times  finer  detail  than  what  is  ordinarily  obtainable  from  earth-bound 
telescopes.  Unfortunately,  except  for  special  cases  in  which  an  unresolved  star  is  very 
near  the  object  of  interest^,  the  Fourier  moduius  data  can  be  used  to  directly  compute 
only  the  autocorrelation  of  the  object  and  not  the  object  itself.  In  recent  years,  it  has 
been  shown  that  this  stumbling  block  can  be  overcome  by  an  iterative  method^  of  computing 
the  object’s  spatial  (or  angular)  brightness  distribution,  which  uses  the  Fourier  modulus 
data  provided  by  stellar  speckle  interferometry  combined  w.th  the  a  priori  knowledge  that 
the  object  distribution  is  nonnegative.  This  method  provides  an  alternative  to  other  fine- 
resolution  imaging  techniques* > 5 . 

In  the  remainder  of  this  paper,  stellar  speckle  inteferometry  and  the  iterative  recon- 
struction  method  are  briefly  reviewed.  Then  more  detailed  discussions  of  noise  terms  and 
MTF  factors  present  in  speckle  interferometry  are  given,  and  methods  of  obtaining  an  im¬ 
proved  estimate  of  an  astronomical  object’s  Fourier  modulus  are  described.  Finally,  some 
recent  results  obtained  with  telescope  data  are  shown. 

Basic  stellar  speckle  interferometry 

Labeyrie’s  stellar  speckle  Interferometry  starts  by  taking  a  number  of  short-exposure 
images  of  an  astronomical  object; 

d„(x)  =  f{x)  *  s„(x)  (1) 

fn  m 

where  f(x)  is  the  spatial  or  angular  brightness  distribution  of  the  object  and  Sn,(x)  is 
the  point-spread  function  due  to  the  combined  effects  of  atmosphere  and  the  telescope  for 
the  m^l^  exposure.  The  coordinate  x  is  a  two-dimensional  vector  and  ^  denotes  convolu¬ 
tion.  It  is  assumed  that  the  exposure  time  is  short  enough  to  "freeze"  the  atmosphere  and 
only  a  narrow  spectral  band  is  used.  The  Fourier  transform  of  each  short-exposure  image 
is  computed; 

D„{u)  =  /  d_{x)  exp  (i2irux)  dx  (2) 

I"  J-. 

In  this  paper,  capital  letters  will  denote  the  complex  Fourier  transforms  of  the  corre¬ 
sponding  lower-case  letters,  and  the  coordinate  u  is  referred  to  as  a  spatial  frequency. 

The  summed  squared  Fourier  modulus  (the  summed  power  spectrum)  is  computed: 

MM  M 

Z  =  Z  |F(u)S^(u)l^  =  iFIull^J]  |S„(u)l2  (3) 

m=l  m=i  m=i 
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Tnt  f3ctor  Z-|S^(u)|-  can  be  thought  of  as  the  square  of  the  MTF  of  the  speckle  interfer- 
oi"':' t ry  process  (the  speckle  mTF^)  and  it  can  be  determined  approximately  by  performing 
stellar  specKle  interferometry  on  an  isolated  unresolved  star  through  atmospheric  condi¬ 
tions  naving  tlie  same  statistics  as  those  througti  «hich  the  object  imagery  is  taken. 
Dividing  the  summed  power  pectrum  by  the  speckle  MTF^  yields,  according  to  Eq.  (1), 
lE(u)|-,  tlie  squared  Fourier  modulus  of  the  object. 

Since  it  is  simply  the  Fourier  transform  of  |F(u)l2,  the  autocorrelation  of  the  object 
can  be  obtained  by  Labeyrie's  method.  However,  the  autocorrelation  gives  only  very  limited 
information  about  the  object:  its  diameter,  and  the  separation  for  the  case  of  a  binary 
star  system.  Only  for  the  special  cases  of  (1)  an  object  known  to  be  centro-symmetric  and 
(2)  an  object  having  an  isolated  unresolved  star  within  the  same  isoplanatic  patch^ 

(within  a  few  seconds  of  arc)  can  the  autocorrelation,  or  equivalently  |F(u)|^,  be  used 
to  directly  compute  the  object. 

The  iterative  method 

We  have,  in  addition  to  the  measured  Fourier  modulus,  the  a  p'iori  knowledge  that  the 
object  brightness  is  a  real,  nonnegative  function.  The  reconstruction  problem  consists  of 
finding  a  nonnegative  object  that  is  consistent  with  the  measured  Fourier  modulus  data. 

This  problem  can  be  solved  by  the  iterative  method  depicted  in  Figure  1.  It  consists  of 


Figure  1.  Iterative  processing  overview. 

four  steps:  (1)  an  initial  estimate  of  the  object,  g(x)  (which  we  usually  choose  to  be  a 
field  of  random  numbers),  is  Fourier  transformed;  (2)  in  the  Fourier  domain,  the  measured 
Fourier  modulus  is  substituted  for  the  computed  Fourier  modulus,  and  the  computed  phase  is 
unaltered;  (3)  the  result  is  inverse  Fourier  transformed,  yielding  an  image  g’(x);  and  (4) 
a  new  g(x)  is  chosen,  based  on  the  violation  of  the  object-domain  constraints  by  g'(x). 

The  four  steps  are  repeated  until  the  mean-squared  error  is  reduced  to  a  small  value  con¬ 
sistent  with  the  signal-to-noise  ratio  of  the  measured  Fourier  modulus  data.  The  mean- 
squared  error  in  the  image  domain  is 


where  the  region  t  includes  all  points  at  which  g’(x)  violates  the  object  domain  con¬ 
straints  (where  it  is  negative  or  possibly  where  it  exceeds  an  a  priori  known  diameter). 
Several  different  methods  for  choosing  a  new  g(x)  have  proven  successful.  For  the  results 
shown  in  this  paper,  we  used  for  most  iterations 

’gi’(x)  ,  X 

Qk^lCx)  =  ' 

g^(x)  -  -Sg^^lx) ,  X  CY 
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interspersing  with 


g^(x),  X  iir 


X  Elf 


(6) 


overy  fev»  iterations,  where  the  subscript  •<  refers  to  the  k^h  iteration  and  y  is  oefined 
as  in  Eq.  (4).  More  detailed  discussions  of  the  iterative  method  and  why  it  works  can  be 
founo  in  fieferences  3  sno 


For  the  binary  star  resuits  shown  iater,  when  random  numbers  were  used  for  the  initial 
input,  then  over  a  hundred  iterations  were  required  for  an  array  size  of  128  x  128  pixels 
(about  the  same  number  of  iterations  that  has  been  required  for  complicated  two-dimensional 
objects),  taking  about  two  minutes  on  a  Floating  Point  Systems  AP  1208  array  processor. 

When  a  binary  star  pattern  witn  the  correct  spacing  (which  can  be  determined  from  the  auto¬ 
correlation)  but  the  incorrect  brightness  ratio  was  used  as  the  initial  input,  then  only  a 
dozen  iterations  were  required  for  convergence,  taking  about  10  seconds.  It  was  found  that 
the  nonnegativity  constraint  was  sufficient,  and  the  diameter  constraint  was  not  needed. 

Noise  and  MTF  characteristics  and  their  compensation 

The  data  used  for  the  experiments  described  here  were  obtained  from  the  Steward  Observ¬ 
atory  Speckle  Interferometry  Program  which  is  described  in  more  detail  elsewhere  in  this 
proceedings  volume^^  Por  this  "event  detection"  data,  it  is  assumed  that  any  one  short 
exposure  image  contains  no  more  than  one  photon  in  any  one  pixel  (and  most  pixels  record 
zero  photons).  After  an  image  is  magnified,  intensified,  and  detected,  (among  other 
things)  it  is  thresholoed  to  produce  an  image  consisting  of  ones  (where  above  the  thresh¬ 
old)  and  zeros.  Each  image  is  autocorrelated,  and  the  sum  of  ail  the  autocorrelations  is 
computed.  The  summed  power  spectrum  is  computed  as  the  Fourier  transform  of  the  summed 
autocorrelation.  In  addition,  each  image  is  centrolded  (translated  to  make  their  centroids 
coincident)  to  within  the  nearest  pixel,  and  the  sum  of  the  centrolded  images  is  computed. 

The  telescope  diameter  is  2.3  meters;  and,  for  30X  magnification  of  the  image,  the  image 
scale  is  approximately  0,02  arc-sec  per  pixel  along  each  line,  and  is  about  0.017  arc-sec 
per  line  (it  is  stretched  by  about  13*  in  that  dimension  relative  to  the  along-llne  dimen¬ 
sion),  For  6QX  magnification,  the  scale  is  half  that.  The  data  is  digitized  in  256  x  256 
arrays . 

Figure  2  shows  an  example  of  a  cut  through  the  summed  power  spectrum  of  an  unresolved 
star,  SAO  36615,  This  data  was  taken  at  60X  magnification  (0.01  arc-sec  per  pixel  =  A. 7  x 
10*®  raolans  per  pixel)  at  a  30  nm  wavelength  band  centered  at  750  nm.  The  scale  in  the 
Fourier  domain  is  750  nm/(4.7  x  10*®  rad  x  256)  =  0.062  meters  (of  telescope  aperture) 
per  pixel.  For  a  telescope  diameter  of  2.3  meters,  the  highest  spatial  frequency  passed 
by  the  telescope  aperture  is  2.3  m/(0.062  m/pix)  =  37  pixels  from  zero  frequency.  Ideally 
(no  atmosphere  or  aberrations  and  no  noise),  the  summed  power  spectrum  of  an  unresolved 
star  would  be  the  square  of  the  mtf  due  to  the  telescope  aperture  (that  MTF  Is  the  autocor¬ 
relation  of  the  telescope  pupil  function).  Assuming  a  circular  aperture,  a  cut  through 
the  telescope  aperture  MTF  would  have  a  roughly  cone  shape®  and  be  zero  beyond  pixel  37. 
However,  the  summed  power  spectrum  of  the  unresolved  star  shown  in  Figure  2  is  very  far 
from  this  ideal. 

Two  effects  dominate  the  shape  of  the  power  spectrum.  First,  the  speckle  MTF^,  men¬ 
tioned  earlier  in  connection  with  Eq.  (3),  drops  very  rapidly  for  the  very  low  spatial  fre¬ 
quencies  near  the  atmospheric  cut-off.  This  results  in  the  spike-like  behavior  of  the 
summed  power  spectrum  for  very  low  spatial  frequencies.  Beyond  the  very  low  spatial- 
frequency  region,  the  speckle  MTF^  is  much  better  behaved  and  decreases  slowly^.  Second, 
photon  noise  results  in,  among  other  things,  a  noise  bias  term  in  the  summed  power  spec- 
trum^®.  This  noise  bias  term  dominates  in  the  higher  spatial  frequencies.  Beyond  a 
radius  of  37  pixels  in  the  summed  power  spectrum,  no  signal  energy  exists  --  it  is  purely 
noise.  They  so  dominate  the  summed  power  spectrum  that  iittie  useful  information  can  be 
obtained  unless  compensation  is  made  for  both  of  these  two  effects. 

The  noise  bias  term  and  the  detection  transfer  function 


One  would  ordinarily  eliminate  the  noise  bias  term  simply  by  subtracting  a  constant  from 
the  summed  power  spectrumiOf .  However,  as  seen  from  Figure  2,  the  noise  bias  term, 
which  is  seen  by  itself  beyond  pixel  37,  is  not  a  constant  in  this  case.  This  results  from 
the  fact  that  upon  detection  and  thresholding,  a  single  photon  sometimes  results  in  more 
than  one  pixel  recording  a  one,  depending  upon  the  threshold  level  and  the  size  of  the 
splotch  of  light  exiting  from  the  image  intenslfier.  Table  1  shows  the  autocorrelations 
and  the  individual  squared  transfer  functions  of  some  of  the  various  patterns  of  ones  re¬ 
sulting  from  a  single  photon.  Each  pattern  is,  in  effect,  the  Impulse  response  of  the  de- 
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Table  1.  Event  detection  data:  individual 
impulse  responses,  their  auto  correlations, 
and  their  power  spectra. 
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Figure  2,  Summed  power  spectrum  of  an  un¬ 
resolved  star  (linear  scale). 

The  middle  and  upper  curves  are 
the  same  as  the  lower  curve,  ex¬ 
cept  have  lOX  and  lOOX  vertical 
scales,  respectively. 
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tection  system;  and  in  any  one  Image,  several  different  patterns  may  appear.  That  is,  this 
impulse  response  may  vary  from  photon  to  photon  within  the  same  image.  Assuming  a  sparse 
popuiation  of  photons  within  each  image,  it  can  be  shown  that  the  net  squared  transfer 
function,  due  to  the  ensemble  of  photon-produced  patterns  within  an  image,  is  given  by  a 
weighted  sum  of  the  individual  squared  transfer  functions  of  the  individuai  patterns.  We 
refer  to  this  weighted  sum  as  the  detection  transfer  function  squared  (DTF^), 

One  can  compensate  for  the  noise  bias  term  by  the  foiiowing  steps^^,  (i)  Over  the 
spatiai  frequencies  above  the  teiescope  cut-off,  perform  a  two-dimensionai  ieast-squares 
fit  of  a  weighted  sum  of  individuai  squared  transfer  functions  (some  of  which  are  shown  in 
Tabie  i)  to  the  summed  power  spectrum.  By  this,  the  DTF2  is  determined.  (2)  Compensate 
the  effects  of  the  DTf2  jjy  dividing  the  summed  power  spectrum  by  the  DTf2  (for  aii  spa¬ 
tiai  frequencies).  By  this,  the  noise  bias  term  is  made  a  constant.  (3)  Subtract  from 
the  DTF2_compensated  summed  power  spectrum  the  constant  noise  bias  term.  This  DTf2 
and  noise  bias  compensation  are  demonstrated  in  Figures  3  and  4  for  the  binary  star  system 
SAO  94163.  In  this  case,  the  magnification  was  30X  and  the  waveiength  was  750  nm  (iO  nm 
spectrai  bandwidth)  and  so  the  teiescope  cut-off  is  at  a  spatiai  frequency  of  74  pixeis. 
This  data  set  resulted  from  power-spectrum  averaging  of  i820  short  exposure  images  contain¬ 
ing  a  totai  of  about  2.4  x  i05  photons. 


In  the  autocorrelation  domain,  the  noise  bias  term  results  in  a  spike  at  the  (0,  0) 
coordinate,  and  the  0Tf2  causes  the  spike  to  be  spread  over  a  few  pixeis  about  (0,  0). 
Compensation  for  the  DTf2  causes  the  spike  to  coiiapse  to  a  deita-function  at  (0,  0). 

Then  the  subtraction  of  the  noise  bias  in  the  Fourier  domain  removes  the  deita-function  at 
(0,  0)  in  the  autocorreiation. 

More  generaiiy,  the  functionai  form  of  the  DTf2  is  heaviiy  dependent  on  the  manner  in 
which  the  images  are  detected  and  shouid  be  modified  according  to  the  characteristics  of 
the  detection  hardware  used. 

The  speckie  MTF^ 


Compenstion  for  the  speckie  MTF2  wouid  ordinariiy  be  accompiished  by  dividing  the 
summed  power  spectrum  by  the  summed  power  spectrum  of  a  reference  star!.  Both  power 
spectra  should  first  be  corrected  for  the  0Tf2  and  the  noise  bias  term. 


PIXELS 


Figure  3.  Power  spectrum  of  the  binary  SAO 
94163.  (A)  Middle  curve;  raw  sum¬ 
med  power  spectrum;  (B)  upper 
curve:  sunned  power  spectrum  di¬ 
vided  by  the  DTF^;  (C)  lower 
curve;  DTF2  -  compensated  summed 
power  spectrum  with  noise  bias 


0  ^ 


Figure  4.  (A) -(C)  Two  dimensional  view  of 

Figure  3(A)-(C) ; (D)the  Fourier 
modulus,  i.e.,  the  square  root 
of  (C) .  Note:  the  residual 
noise  beyond  the  telescope  cut¬ 
off  frequency  is  visible  in 
this  case  and  not  in  (C)  because 
the  square  root  operation  re¬ 
duces  the  dynamic  range  of  the 


subtracted. 


data. 


In  some  Instances,  there  Is  not  available  the  summed  power  spectrum  of  a  reference  star 

close  enough  in  both  space  and  time  to  the  observation  of  the  object.  In  that  case,  one 

approach  is  to  approximate  the  actual  speckle  mtf2  tjy  fitting  a  model^  of  the  speckle  mtf2 

to  the  data.  Another  alternative  is  the  Worden  subtract  method^^.  There  seems  to  be  some 

controversy  over  the  effectiveness  of  this  method^*;  there  is  no  doubt  that  it  provides  a 
much  better  estimate  of  the  object’s  power  spectrum  than  the  summed  power  spectrum,  but 
further  investigation  is  needed  to  determine  whether  it  is  the  best  available  estimate. 

The  Worden  subtract  method  of  compensating  for  the  speckle  MTF^  consists  of  subtracting 
from  the  summed  autocorrelation  the  sum  of  cross-correlations  of  different  short -exposure 
images.  The  short-exposure  images  must  be  centroided  before  being  cross-correlatec^^. 

It  is  easily  shown  that  subtraction  of  a  properly  scaled  version  of  the  power  spectrum  of 
the  sum  of  the  centroided  images  from  the  summed  power  spectrum  is  exactly  equivalent  to 
the  Worden  subtract  method.  Figure  5(a)  shows  a  cut  through  these  two  functions  for  very 
low  spatial  frequencies  for  the  unresolved  star  SAO  36615.  Both  are  similar  in  their 
spike-like  behavior  for  very  low  spatial  frequencies.  However,  for  spatial  frequencies  at 
four  pixels  from  zero  frequency  and  beyond,  the  power  spectrum  of  the  sum  of  the  centroided 
images  is  essentially  zero.  Thus,  it  would  appear  that  the  Worden  subtract  method  could 
correct  only  for  the  verv  lowest  spatial  frequencies.  This  is  borne  out  in  Figures  5(b) 
and  5(c).  Furthermore,  recalling  that  the  compensated  summed  power  spectrum  for  an  unre¬ 
solved  star  should  be  a  constant  (weighted  by  the  mtf2  of  the  telescope  aperture  which 
is  nearly  unity  for  these  very  low  spatial  frequencies),  we  see  from  Figures  5(b)  and  (c) 
that  the  Worden  subtract  method  did  not  produce  the  expected  result.  By  getting  rid  of  most 
of  the  low-frequency  spike  in  the  summed  power  spectrum,  the  Worden  subtract  method  did 
greatly  decrease  its  mean-squared  error;  however,  it  did  not  replace  the  spike  with  the 
correct  low-frequency  information.  It  is  not  presently  known  whether  this  apparent  inade¬ 
quacy  of  the  Worden  subtract  method  is  due  to  basic  inadequacies  of  the  method  itself  or 
due  to  problems  with  this  particular  data  set;  however,  it  is  consistent  with  the  analysis 
of  Fante^*. 

A  better  speckle  MTF^  compensation  than  the  werden  subtract  method  in  this  case  would 
simply  be  to  clip  the  summed  power  spectrum  at  the  very  low  spatial  frequencies.  That  is, 
assuming  that  the  object's  power  spectrum  is  nearly  constant  tor  the  very  low  spatial  fre- 
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Figure  S.  Worden  subtract  method  on  an  unresolved  star  (low  apatial 

frequencies).  (A)  upper  curve:  sxmmed  power  spectrum;  low¬ 
er  curve:  power  spectrum  of  the  sum  of  the  centroided  im¬ 
ages;  (B)  upper  curve:  summed  power  spectrum  (note  expanded 
vertical  scale);  lower  curve;  summed  power  spectrum  minus 
the  power  spectrum  of  the  aum  of  the  centroided  ipiages; 

(C)  same  aa  (B) ,  except  a  smaller  percentage  of  the  power 
spectrum  of  the  sum  of  the  centroided  images  was  subtracted. 

quencles  (which  would  be  true  for  objects  of  diameter  only  a  small  fraction  of  an  arc-sec), 
we  replace  the  summed  power  spectrum  in  that  region  by  a  constant.  The  constant  is  chosen 
to  be  consistent  with  the  value  of  the  summed  power  spectrum  in  the  region  just  beyond  the 
very  low-frequency  spike.  As  in  this  case  of  the  Worden  subtract  method,  this  method  does 
not  correct  for  the  middle-frequency  vs.  higher-frequency  regions  of  the  speckle  MTF^; 
however,  as  noted  earlier,  the  speckle  MTF^  is  reasonably  well  behaved  for  those  spatial 
frequencies,  and  correcting  for  the  very  low  spatial  frequencies  corrects  for  the  greatest 
part  of  the  error. 

The  method  of  clipping  the  summed  power  spectrum  to  correct  for  the  speckle  MTF^  is 
demonstrated  in  Figure  6  for  the  binary  SAO  94163  for  which  reference  star  data  was  not 
available.  In  order  to  Increase  the  accuracy  of  the  assumption  that  the  Fourier  modulus 
(or  its  square,  the  power  spectrum)  is  constant  for  very  low  spatial  frequencies,  the  OTF 
and  noise-bias-corrected  Fourier  modulus  was  divided  by  the  MTF  due  to  the  telescope  aper¬ 
ture  (which  was  approximated  by  the  MTF  due  to  a  circular  aperture  of  diameter  2.3  meters). 
The  elliptical  shape  of  the  Fourier  modulus  data  is  due  to  the  difference  in  scale  factors 
in  the  two  dimensions  as  noted  earlier.  Within  the  low  frequency  region,  wherever  the 
Fourier  modulus  exceeded  a  threshold  value,  it  was  clipped  to  that  threshold  value.  The 
result  was  multiplied  by  the  MTF  due  to  the  telescope  to  arrive  at  our  final  estimate  of 
the  Fourier  modulus  of  SAO  96163  Including  the  telescope  MTF.  In  the  process  of  multiply¬ 
ing  back  in  the  telescope  MTF,  the  residual  noise  beyond  the  telescope  cut-off  frequency 
was  set  to  zero. 

Imaqe  reconstruction  results 

The  Fourier  modulus  estimate  shown  in  Figure  6(d)  was  truncated  to  a  128  x  126  array, 
in  order  to  save  computation  time  in  the  iterative  reconstruction.  This  caused  a  slight 
truncation  of  the  highest  spatial  frequencies  along  the  horizontal  dimension  of  Fig¬ 
ure  6(d).  SAO  94163  was  reconstructed  using  the  iterative  method,  and  the  images  resulting 
from  two  different  selections  of  the  initial  input  to  the  algorithm  are  shown  in  Fig¬ 
ures  7U)  and  (b),  respectively.  The  rms  error  Eq  was  reduced  to  about  0.05.  For  the 
purpose  of  display,  a  (sin  x)/x  Interpolation  was  performed  on  the  images  of  Figure  7  in 
order  to  increase  the  sampling  rate  across  the  image.  In  order  to  get  an  indication  of 
the  sensitivity  of  the  method  to  the  clipping  threshold  level  described  in  the  previous 
section,  the  clipping  was  done  over  again  using  a  33%  greater  threshold  value  (which  is 
obviously  greater  than  the  optimum  threshold).  Two  images  reconstructed  from  the  resulting 
Fourier  modulus  estimate  are  shown  in  Figure  7(c)  and  (d).  Half  the  time,  the  iterative 
reconstruction  algorithm  produces  an  image  rotated  by  180®  due  to  the  inherent  2-fold  am¬ 
biguity  of  the  Fourier  modulus  data. 
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Figure  6.  Clipping  to  compensate  for  the 
speckle  MTF  (for  "seeing")  for 
the  binary  SAO  94163.  (A)  Four¬ 
ier  modulus,  same  as  Figure  4(D); 

(B)  Fourier  modulus  compensated 
for  telescope  MTF  (attempted 
division  by  zero  is  evident  for 
spatial  frequencies  above  the 
telescope  cut-off  frequency) ; 

(C)  clipping  of  the  low  spatial 
frequencies;  (D)  Fourier  modulus 
estimate  obtained  by  putting  back 
in  the  telescope  MTF. 

T.ie  average  separation  of  the  reconstructed  images  is  13.9  pixels  *0.27  arc-sec  and 
the  orientation  angle  is  42.7®  (47.3®  from  the  orthogonal  axis).  The  brightness  ratios, 
based  on  the  maximum  brightness  of  each  star  in  the  pair,  are  4.36,  4.17,  4.14,  and  3.96 
for  Figures  7(a)  through  (d),  respectively,  and  the  corresponding  magnitude  differences 
are  1.60,  1.55,  1.54,  and  1.49,  respectively.  Thus,  a  33%  Increase  in  threshold  level 
caused  only  a  6%  decrease  in  the  computed  brightness  ratio.  The  average  magnitude  differ¬ 
ence  of  the  reconstructions  of  Figures  7(a)  and  (b)  is  1.57. 

Conclusions 


Figure  7.  Reconstructed  images  of  SAO 
94163  (see  text) . 


We  have  discussed  steps  necessary  to  obtain  an  accurate  estimate  of  an  object's  Fourier 
modulus  from  the  raw  summed  power  spectrum:  detection  transfer  function  compensation, 
noise  bias  subtraction,  and  speckle  MTF  compensation.  For  compensation  of  the  speckle  MTF 
when  reference  star  data  is  not  available,  an  Improvement  over  the  Worden  subtract  method 
is  the  simple  method  of  clipping  the  Fourier  modulus  spike  at  the  very  low  spatial  frequen¬ 
cies  (for  objects  much  smaller  than  one  arc-sec  in  diameter),  ft  reconstruction  of  the 
binary  SftO  94163  using  this  method  resulted  in  an  Image  having  a  binary  separation  of  0.27 
arc-sec  at  an  angle  of  42.7®  (47.3®)  and  a  magnitude  difference  of  1.57  (brightness  ratio 
of  4.26).  The  reconstruction  of  a  binary  Is  trivial  and  does  not  require  the  use  of  the 
iterative  method;  however,  the  iterative  method  Is  required  for  complicated  objects,  and 
the  data  processing  steps  described  here  for  this  simple  example  can  be  used  in  more  gen¬ 
eral  circumstances.  Based  on  the  success  of  Image  reconstruction  experiments  using  summed 
power  spectra  of  complicated  two-dimensional  objects  computer-simulated  to  Include  the 
effects  of  atmospheric  turbulence  and  photon  noise^^,  it  Is  expected  that  It  will  be  pos¬ 
sible  to  reconstruct  fine-resolution  images  of  complicated  astronomical  objects. 
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Questions  are  raised  concerning  the  uniqueness  of  solutions  to  the  phase -retrieval  problem  for  functions  with  dis¬ 
connected  support.  A  counterexample  is  presented  showing  the  importance  of  considering  the  flipping  of  infinite 
proper  subsets  of  nonreal  zeros. 


INTRODUCTION 

In  this  Letter,  questions  are  raised  concerning  some  claims 
by  Greenaway*  amd  Bates.^  Their  papers  are  concerned  with 
the  question  of  uniqueness  of  solutions  to  the  phase-retrieval 
problem.  This  problem,  in  the  one-dimensional  case,  can  be 
stated  as  follows. 

Let  /  be  a  complex -valued  function  on  the  real  line  that 
vanishes  outside  some  finite  interval.  Let  F  be  its  Fourier 
transform.  Given  the  modulus  of  F  on  the  real  line,  i.e., 
|f(ti)|  for  all  real  u,  the  problem  is  to  reconstruct  the  original 
function  f  from  this  information.  The  general  uniqueness 
question  is;  How  marty  other  functions,  g  f,  exist  that 
vanish  outside  some  finite  interval  and  whose  Fourier 
transforms  satisfy  |G(u)|  =  |F(u)|  for  all  real  u? 


GREENAWAY’S  PAPER 

Greenaway*  considers  a  situation  in  which  the  unknown 
■  function  /  is  known  to  be  zero  outside  the  union  of  two  disjoint 
intervals  (a,c)  and  (d,b).  In  other  words, 

f  =  g-^h, 

where  g  is  zero  outside  (a,c)  and  h  is  zero  outside  (d,b)  (see 
Fig.  1). 

Now  let  F,  G,  and  H  be  the  Fourier  transforms  o{f,g,  and 
h,  respectively,  extended  by  analyticity  into  the  complex 
plane; 

F(w)  =  f(x)e~‘‘‘*dx, 

where  w  -  u  +  iv  and  u  and  v  ere  real.  The  modulus  of  Fon 
the  real  line,  i.e.,  lF(u)l,  is  given. 


I  o  c  d  b 


Fig.  1.  .Memlier  of  the  cl.iss  <if  fuiirliniis  with  disroniirrted  support. 
Note'  .All  hi  High  the  fumiinnsi'  nnd  h  are  represented  here  ns  in  wit  ive 
real  turn  timi'.  they  tan  he  rnniplex  i.diied 


The  question  is;  To  what  extent  do  the  conditions  de¬ 
scribed  above  determine  the  function  /? 

The  functions 

e’^/fx  +  (3)  and  e'^fl—x  0), 

where  the  overbar  denotes  complex  conjugation  and  a  and  d 
are  real,  have  the  same  Fourier  modulus  on  the  real  line  as 
does  /.  If  any  of  these  functions  are  also  zero  outside  the 
union  of  the  intervals  (a,c)  and  (d,b),  then  they  satisfy  all  the 
requirements  and  qualify  as  alternative  solutions.  These 
solutions  will  be  said  to  be  associated  with  the  solution  /. 

Now  the  revised  question  is:  Are  there  any  other  solutions 
not  associated  with  /? 

Let  Wo  be  a  nonreal  zero  of  F,  and  let 

F,(ia)  =  F(ui)^i^^^. 

W  —  Wo 

The  function  F]  can  be  viewed  as  being  gotten  from  F  by  first 
removing  a  zero  at  wo  and  then  adding  a  zero  at  Wa.  In  other 
words,  the  zero  at  Wo  has  been  “flipped"  about  the  real  line. 
Now  for  real  w,  w  =  u, 

u  -wq  _  ^ 
u  -  Wo 

and  therefore 

|F|(u)|  =  |F(u)|  for  all  real  u. 

Hofstetter®  and  Walther^  proved  that  if  fi  is  any  function  that 
vanishes  outside  some  finite  interval  and  lFi(ul|  =  lF(u)l  for 
all  real  u ,  then  Fi  is  gotten  from  F  by  flipping  various  sets  of 
nonreal  zeros  of  F  and  multiplying  by  a  constant  of  modulus 
1  and  by  an  exponential  function.  In  particular,  if  Fi  is  ob¬ 
tained  from  F  by  flipping  the  set  of  all  its  nonreal  zeros,  then 
its  inverse  transform  fi  satisfies 

fi(x)  =  f(-x), 

and  thus,  if  fi  vanishes  outside  the  union  of  (a,r)  and  (d,b), 
then  /i  is  a  solution  associated  with  /.  (Here,  if  a  zero  of  F  has 
multiplicity  n,  it  must  be  flipped  n  times.) 

Now  let  Z(F)  denote  the  set  of  all  nonreal  zeros  of  F. 
Greenaway  claims  that  if  F|  is  obtained  from  F  by  flipping 
any  proper  subset  S  of  Z(F)  (i.e.,  F  Z(F)1  and  if fi  vanishes 

outside  the  union  of  (o.r)  and  (d,b),  then  all  the  ixiints  in  .s' 
are  zeros  of  both  G  and  H 
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Fig.  2.  The  function  <t>(x ). 

Thus,  if  G  and  H  have  no  zeros  in  common  (which  would 
usually  be  the  case  if  g  and  h  are  gotten  more  or  less  randomly 
from  the  real  world),  it  would  follow  that  the  only  solutions 
are  f  and  its  associated  solutions.  Bates^  also  considers  this 
problem  from  another  point  of  view. 

Greenaway's  claim  is  true  in  the  special  case  in  which  F  has 
only  a  finite  number  of  nonreal  zeros.  (Actually,  Greenaway’s 
proof  holds  only  for  the  more  restricted  case  in  which  F  has 
a  Finite  number  of  nonreal  zeros  of  order  I.  However,  the  case 
of  higher-order  zeros  can  be  taken  care  of  by  an  extension  of 
his  argument.  See  Ref.  6.) 

The  following  counterexample  shows  that  Greenaway's 
claim  is  not  true  in  general.  In  this  counterexample,  the  set 
Z(F)  is  infinite  and  5  is  an  infinite  proper  subset  of  Z(F). 

Counterexample 

Let 


for|x|  <1 
for|x|  >  l' 


(See  Fig.  2.)  Then  the  Fourier  transform  <l>  of  0  is  given  by 


♦(U)) 


sin^(u)/2) 

(u)/2)* 


sinc^  (wf2). 


Note  that  <l>  has  no  nonreal  zeros. 
Now  let 


g(x) «  80(x) 


and 


/i)x)  =  20(x  -  4)  +  30(x  -  8)  +  20(x  -  12). 

Then  G  *  8<I>  has  no  nonreal  zeros,  and  hence  G  and  H  have 
no  nonreal  zeros  in  common.  Let 

f(x)  •g(x)  +  h{x) 

«  8</>(x)  +  20(x  -A)  +  30(x  -  8)  -t-  2<t>(x  -  12) 
and  let 

a  “  -1,  c  ■  1,  d  »  3,  6*13. 

(Sec  Fig.  3.)  Then  o<c<t/<6,  the  intervals  (a  ,c)  and  (d,b) 
are  disjoint,  and  f  is  zero  outside  the  union  of  (a,c)  and  (d,b). 
The  Fourier  transform  of  f  is 

F(u')  -  (8  +  2e-‘"'  +  36-*'“  + 

-  2(e-<'"  +  2)(e-»"‘  -  0.5e-<'"  +  2)<P(u  }.  (1) 

Now  let 

Ri(x)  4<Mx) 
and 

/ii(.r)  -  7^(x  -  4)  +  4</>(a  -  12). 


Then  Gi  »  4<1>  has  no  nonreal  zeros,  and  hence  G i  and  Hi  have 
no  nonreal  zeros  in  common.  Let 


/i(x)  *gi(x)  +  hi(x) 

-  4<filx)  +  70(x  -  4)  -(■  4<^(x  -  12). 

(See  Fig.  3.)  Then  /i  is  also  zero  outside  the  union  of  (o  ,c)  and 
(d,b).  The  Fourier  transform  of  /i  is 

F,(w)  ’‘(4  +  7e-<'“'  +  4e-‘2‘“')<I>(u)) 

■  2(2e-*“’  -1-  l)(e-®‘“'  -  0.5e~<'“’  +  2)^{w) 

=  2e-<^“'  (e<'“’  -1-  2)(e-*“'  -  0.5e-^“  +  2)i‘{w).  (2) 


It  follows  from  Elqs.  (1 )  and  (2)  that 

Now,  for  real  w,w  =  u. 


Therefore 


e*'“  +  2 

g-iiu  +  2 


1. 


|F|(u)|  •*  |F(u)|  for  all  real  u. 

Thus  /  and  fi  are  both  solutions,  and  it  is  clear  that  they  are 
not  associated. 

In  order  to  see  which  zeros  must  be  flippied  to  get  Fi  from 
F,let 

r,(u))  »e + 2 

and 

r2(a’)  »  e-**-'  -  0.5e-«'“  +  2. 

Then 


and 


F{w)  =  2r  i(u>)r2(u))<i>(u>) 


Fi(u;)  “  2e 


-■•'ll'  r.liTi 


(ui)r2(ui)<I>(U'). 


(3) 

(4) 


9  ■ 


h  it) 


Fig.  .3.  Funrlions  /(rl  (abovrl  and  /|(r)  (below)  have  ihe  same 
Fourier  modulus. 
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Fig.  4.  Above:  nonreal  zeros  of  F.  Below:  nonreal  zeros  of  Fi. 
The  circled  zeros  are  flipped. 


Since  has  no  nonreal  zeros  and  e"^'“  is  never  zero,  it  follows 
from  Elqs.  (3)  and  (4)  that 

Z{F)^ZiTi)uZiT2) 

and 

Z{Fi)^ZU7i  o  ZiTi), 

where 

ZiTi)  =  |u!:  a'6Z(ri)|. 

Thus  the  zeros  of  F  that  are  in  S  =  ZfPi)  are  flipped.  The  sets 
Z(T\)  and  ZfTj)  are  given  by 

Zd’i)  =  |— +  —  n  +  -  log  2:  n  =  0,  ±1,  ±2, . .  .1 
U  2  4  j 

and 

Z(r2)  =  |±^  tan"'  v/3T  +  ^  n  +  i  ^  log  2: 

n  =0.±l,±2,...|. 


(See  Fig.  4.)  The  flipping  of  the  zeros  in  S  is  followed  by 
multiplication  by  the  exponential  e"^‘“'.  The  latter  simply 
has  the  effect  of  translating  fi  into  the  proper  position. 

In  the  above  example  the  function  could  be  replaced  by 
any  function  that  is  zero  outside  the  interval  (—1,1)  and  whose 
Fourier  transform  has  no  nonreal  zeros.  For  example.  <<i  could 
be  replaced  by 

4)i(x)  =  (<>  •  <>)(2j), 


where  •  denotes  convolution,  or  by 


4>2(x)  “ 


for  |x|  <  1 
for|x|  >  r 


BATES’S  PAPER 

Bates^  considers  the  situation  in  which 


/(j) 


V 

n  •  1 


/„(x). 


where  each  f„  is  zero  outside  an  interval  /„  and  the  intervals 
I„,  n  =  I  . . .  N,  are  pairwise  disjoint.  He  claims  that  if  the 
Fourier  transforms  F„  have  no  nonreal  zeros  common  to  all 
of  them,  then  f  and  its  associated  solutions  are  the  only 
functions  with  compact  support  and  whose  Fourier  transforms 
have  the  same  moduli  as  that  of  the  Fourier  transform  of  f. 
Thus  Bates  claims  even  more  than  Greenaway  does.  There¬ 
fore  the  above  example  is  also  a  counterexample  to  Bates's 
claim. 

It  should  be  noted  that  the  function  f  to  which  Bates  is 
applying  this  argument  is  the  one-dimensional  projection  of 
a  function  defined  on  the  plane.  He  concludes  from  this 
argument  that,  in  the  two-dimensional  case,  solutions  with 
disconnected  support  are  almost  always  unique  (up  to  asso¬ 
ciated  solutions).  This  conclusion  regarding  two-dimensional 
uniqueness  may  well  be  true  for  other  reasons  discussed  by 
Bruck  and  Sodin.'^ 

COMMENTS 

We  stress  that  functions  with  disconnected  support  whose 
Fourier  transforms  have  only  a  finite  number  of  nonreal  zeros 
form  a  special  class  of  functions.  It  can  be  shown  (see  Ref.  6) 
that  such  functions  satisfy  certain  special  conditions.  Thus 
it  is  quite  common  for  functions  with  disconnected  support 
gotten  more  or  less  randomly  from  the  real  world  to  have 
Fourier  transforms  with  an  infinite  number  of  nonreal  zeros. 
The  fact  that  one  is  able,  in  practice,  to  compute  only  a  finite 
number  of  them  does  not  change  these  conclusions.  One 
cannot  claim  to  be  making  statements  about  the  uniqueness 
of  a  solution  unless  the  entire  infinity  of  nonreal  zeros  of  its 
Fourier  transform  is  properly  considered. 

Finally,  we  note  that  the  example  presented  here  does  not 
imply* that  the  attempt  to  reconstruct  functions  from  the 
moduli  of  their  Fourier  transforms  is  hopeless.  It  can  be 
shown®  that  a  stronger  separation  condition  on  the  separate 
parts  of  the  support  of  {  (i.e.,  that  the  parts  are  separated  by 
certain  greater  intervals  than  assumed  by  Greenaway  and 
Bates)  does  imply  that  f  and  its  associated  solutions  are  the 
only  solutions. 

This  research  was  supported  by  the  U.S.  Air  Force  Office 
of  Scientific  Research  under  contract  no.  F49fi20-80-r- 
0006. 
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A  representation  theorem  for  band-limited  functions  of  several  variables  whose  moduli  are  radial  is  stated.  The 
proof  of  this  theorem,  which  is  based  on  techniques  from  Fourier  analysis  and  the  theory  of  analytic  functions  of 
several  variables,  is  outlined  in  the  Appendix.  Several  uniqueness  results  concerning  the  phase-retrieval  problem 
that  follow  from  the  representation  theorem  are  derived.  References  that  relate  the  phase-retrieval  problem  to 
the  problem  of  wave-front  reconstruction  in  optics  and  image  reconstruction  in  speckle  interferometry  are  cited. 


INTRODUCTION 

The  phase-retrieval  problem,  which  consists  of  determining 
a  function  that  vanishes  outside  a  bounded  region  from  the 
modulus  of  its  Fourier  transform,  arises  in  several  areas  of 
physics  and  engineering,  e.g.,  wave-front  reconstruction  in 
optics  and  electron  microscopy  and  stellar  speckle  interfer¬ 
ometry. 

A  major  problem  that  arises  concerns  the  uniqueness  of 
solutions.  This  problem  has  been  extensively  studied  for  the 
one-dimensional  case  (see  Refs.  1-6)  by  using  the  theory  of 
analytic  functions  of  a  complex  variable.  The  main  result  of 
these  studies  indicates  that  it  is  not  possible,  in  general,  to 
reconstruct  a  function  of  one  variable  from  the  modulus  of  its 
Fourier  transform.  This  result  follows  from  the  fact  that  if 
/  is  a  (square-integrable  and  nonzero)  function  of  one  variable 
that  vanishes  outside  a  finite  interval  of  real  numbers,  then 
its  Fourier  transform  /,  considered  to  be  extended  to  a  func¬ 
tion  of  a  complex  variable,  admits  a  particular  factorization 
in  which  appears  the  infinite  set  of  complex  zeros  of  /. 
Flipping  any  of  the  nonreal  zeros  of  /  (that  is,  replacing  a  zero 
in  the  factorization  of  /with  its  complex  conjugate)  does  not 
change  the  modulus  of  /  or  the  interval  outside  which  /  van¬ 
ishes.  Since,  in  general,  an  infinite  subset  of  the  set  of  zeros 
of  /  will  be  nonreal,  this  zero-flipping  argument  shows  that, 
in  general,  the  one-dimensional  phase-retrieval  problem  ad¬ 
mits  of  an  infinite  number  of  distinct  solutions. 

Although  the  physical  situations  in  which  the  phase-re¬ 
trieval  problem  arises  involve  functions  of  two  variables,  the 
corresponding  uniqueness  problem  has  not  been  completely 
solved.  A  practical  iterative  technique  for  reconstructing  f 
from  the  modulus  of  its  Fourier  transform  for  the  stellar 
speckle  interferometry  problem  (in  this  problem  /  is  a  real¬ 
valued  nonnegative  function  representing  the  brightness 
distribution  of  a  star  or  other  space  object)  is  discussed  in  Refs. 
7-10.  Empirical  results  using  this  technique  and  the  theo¬ 
retical  discussions  in  Refs.  11-13  indicate  that  the  ambiguity 
for  the  phase-retrieval  problem  in  two  dimensions  (especially 
when  /  is  a  real-valued  nonnegative  function)  is  reduced. 
However,  specific  examples  of  nonuniqueness  for  the  two- 
dimensional  phase- retrieval  problem  can  easily  be  constructed 


as  shown  in  Ref.  12  and  also  by  the  following:  Let  g(x ,  y)  and 
h|(x,  y)  be  (nonzero)  real-valued  nonnegative  and  square- 
integrable  functions  that  vanish  outside  a  bounded  region  of 
the  plane  and  such  that  h2(x,>’)  ft  h|(-x,  -y)  for  all  (x,y). 
Then  define  /i(x,y)  and  ftlx.y)  by 


fk(x.y) 


g{u,o)hk{x  —u,y  —  v)dudv 


for  =  1,  2. 


Then  the  functions  ft  and  fz  are  distinct,  real-valued,  non¬ 
negative,  and  square  Integra ble  and  vanish  outside  a  bounded 
region  of  the  plane,  and  an  application  of  the  convolution 
theorem  for  Fourier  transforms  shows  tliat  the  moduli  of  their 
Fourier  transforms  are  identical. 

The  discussion  above  suggests  some  of  the  distinctive  and 
intriguing  aspects  of  the  uniqueness  question  for  the  two- 
dimensional  phase-retrieval  problem.  This  paper  treats  two 
uniqueness-related  questions  that  arise  in  the  followingspe- 
dal  case  of  the  phase-retrieval  problem;  Let  /  be  a  function 
(of  N  ^  1  variables)  such  that  /  is  square  integrable  and  van¬ 
ishes  outside  a  bounded  set  and  such  that  the  modulus  |/|  of 
its  Fourier  transform  is  radially  symmetric. 

Question  1.  What,  if  any,  symmetry  properties  concerning 
/can  be  deduced? 

Question  2.  Under  what  conditions  is  /  determined 
uniquely  from  |/|  ? 

The  answers  to  these  questions,  together  with  two  physical 
applications,  are  stated  below. 

Answer  I.  If  N  >  3,  then,  up  to  translation,  /  is  a  radial 
function.  If  N  ■  2,  then,  up  to  translation,  /  is  the  product 
of  a  radial  function  times  the  function  explikO),  where  i  » 
k  is  an  integer,  and  9  “  9(x,  y )  denotes  the  polar  angle 
of  a  planar  point  having  rectangular  coordinates  (x,  y).  If  N 
“  1,  no  symmetry  property  for  /  necessarily  holds. 

Application  I.  Answer  1  implies  that  if  the  real  (detected) 
impulse  response  function  of  a  thin  lens  (whose  thickness  and 
index  of  refraction  are  continuous)  is  radially  syTnmetric,  then 
its  aperture  function  (both  phase  and  apodization)  must  be 
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radially  symmetric  and  hence  its  coherent  transfer  function 
is  radially  symmetric. 

Answer  2.  For  S  arbitrary,  if  f  is  known  to  be  real,  then  { 
is  uniquely  determined  by  |/|.  Otherwise,  in  general,  there 
are  an  infinite  number  of  distinct  solutions. 

Application  2.  Answer  2  implies  that  the  brightness  dis¬ 
tribution  of  a  space  object  having  a  radially  symmetric 
brightness  function  f  (for  instance  a  star  with  no  significant 
sunspots)  can  be  uniquely  reconstructed  from  |/|  (where  |/| 
is  obtained  from  the  stellar  speckle  interferometry  technique 
of  Labeyrie)  hy  using  Fienup’s  iterative  technique. 

In  the  following  section  we  introduce  mathematical  notation 
that  is  used  to  state  a  representation  theorem  for  band-limited 
functions.  Since  the  derivation  of  this  theorem  is  quite 
elaborate,  it  is  placed  in  the  Appendix.  Then  three  conse¬ 
quent  results  that  are  stated  as  corollaries  are  derived. 

Answer  1  follows  directly  from  Corollary  1  and  the  example 
in  Eq.  (3),  whereas  Answer  2  follows  from  Corollaries  2  and 
3. 

NOTATION  AND  DERIVATIONS 

If  N  is  a  positive  integer  and  z  ■  (zi, . . .  -Zat),  u  *  (wi, . . .  ,wn) 
are  elements  of  C^,  where  C  is  the  complex  plane,  define  (z, 
w)  «  Denote  by  L2(R^)  the  set  of  complex  valued 

functions  on  /?•''  C  C  '‘  (R  is  the  real  line)  having  integrable 
squared  modulus,  and  for/  e  let 

A*)*  f  /Cy)exp(-2xi(x,y))dy  (1) 

Jr'' 

denote  the  Fourier  transform  of  /. 

We  define  a  function  F  e  LziR^')  to  be  band  limited  if  f' 
vanishes  identically  off  some  bounded  subset  of  R^.  Let 
R{R^')  denote  the  set  of  band  limited  functions  on  R^.  Note 
that  a  function  is  band  limited  if  and  only  if  it  is  the  Fourier 
transform  of  a  square  integrable  function  that  vanishes  out¬ 
side  a  bounded  region.  We  define  a  function  F  on  R^  to  be 
radial  if  F{x)  =  Ffy)  whenever  <x,  x)  «  (y,  y).  Let 
RAD(/?^)  denote  the  set  of  radial  functions  of  R^. 

Representation  Theorem 

If  JV  >  1  and  F  e  B(R^)  and  |f  |  e  RAD(/J'''),  then  F  can  be 
extended  to  an  analytic  function  F:  (s,  which  admits 

the  representation 

F(z)  -  P(z)exp(2xi  (xo,  z))  |l  -  ;~j’  (2) 

where  ' 

xo  e  R^, 

|X»|  are  an  infinite  sequence  of  nonzero  complex  num¬ 
bers, 

P(z)  is  a  polynomial  having  either  the  form  (a)  If  N  »  2, 
thenP(zi,z2)  *  A(zi  -biz  2)”"  (zi  -  iz  2)"”  for  some  integers 
mi.  m2  2  Oand  some  A  e  ^  or  (b)  If  Af  g  3,  then  P(z)  *  A((z, 
z )  I"  for  some  integer  m  g  0  and  some  A  e  C. 

I’ruof.  See  Appendix. 

The  representation  in  Eq.  (2)  above  is  interesting  because  it 
Is  not  nccessnrilv  true  in  the  one'dimensional  situation,  as  is 


demonstrated  by  the  following  example: 

Let  f:R  —  ^  be  defined  by 

'■’'W.-'*  (31 

10  otherwise 

and  let 

r,,  .  .  sin  t(x  —  1) 

Fix)"  fix)  ~ - -  (4) 

x(x  -  j) 

Then  the  analytic  continuation  Fix )  of  Fix)  admits  the  fac¬ 
torization 

where 


Clearly,  F  e  fi(fi)and  |F|  e  RAD(fi);  however,  the  roots  (i 
±  fc)  do  not  occur  in  negative  pairs,  as  Elq.  (2)  would  imply. 
This  difference  is  based  on  the  topological  property,  utilized 
in  the  Appendix,  that  the  sphere 

S^-'«(xe  fi^'|(x,x)  -1)  (6) 

is  connected  if  and  only  if  N  >  1. 

Corollary  I.  A  function  F  satisfies  the  hypothesis  of  the 
theorem  if  and  only  if  F  •  /,  where  f  is  the  translation,  by  xo 
e  R^,  of  a  function  h  e  LqIR'^)  and  that  has  the  form  ei¬ 
ther 

(a)  For  N  "  2,  hiy,,  yt)  •  exp(i(mi  -  mzlS)  g(yi,  y2), 
where  6  ■  arctan  (yz/yi)  and  g  is  radial, 

or 

(b)  For  N  i  3,  h  is  radial. 

Proof.  The  translation  part  follows  directly  from  the 
Fourier  inversion  theorem,  and  parts  (a)  and  (b)  follow  from 
the  facts:  (1)  the  Fourier  transform  satisfies  the  following 
property  with  respect  to  rotation;  If  Af  is  a  rotation  trans¬ 
formation  M:R^  -►  R^  and  f  e  L2iR^')  and  gix)  "  fiMx), 
then  AMy)  “^(y).and  (2)  forN  >  3,P(Afz)  “  P(z),andfor 
N  "  2,  P(Mz)  “  exp(i(mi  -  072)6]  P(z),  where  ^  is  the  angle 
of  rotation. 

Corollary  2.  For  N  >  \  the  following  tw,o  conditions  are 
equivalent: 

(1)  F,G  €  BiR^),  |F|  "  |fi  |,  |F|  is  radial,  ^and  G  are 
real. 

(2)  There  exists  a  real  radial  function  h  e  LzfP^)  such 
that  h  vanishes  off  some  bounded  subset  of  and  both  F 
and  G  are  translates  of  h. 

Proof.  Fact  (2)  implies  that  fact  (1)  follows  directly.  IfF 
and  G  are  real,  then  Fix)  "  F(-x)  and  Gix)  "  Gi-x)  for  every 
X  e  R^;  hence  in  the  representation  in  Corollary  (2)  for  F  and 
G,  the  polynomials  must  have  the  form  given  by  (b),  and  the 
roots  of  each  must  occur  in  the  conjugate  pairs.  Also,  since 
|F(x)|  •  |G(x)|  forallx  6  R^',  the  representations  must  be 
identical  except  for  the  exponential  terms  that  correspond  to 
translation. 

Corollary  3.  Let  F  satisfy  the  hypothesis  of  the  theorem 
and  let  X*  be  any  of  the  roots  in  Eq.  (2).  Then  the  function 
G,  defined  hy 
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G(i)  -  Fix) 


1  -  ix.x)/\k-^ 

1  -  (i.i)/X*2  ’ 


(7) 


also  satisfies  the  hypothesis  of  the  theorem.  Furthermore, 
|G(i)|  •  |F(i)|  for  every  I  e  R^. 

Proof.  The  equality  is  evident,  as  is  the  fact  that  |G|  e 
RAD(fi  and  G  e  L2(R^).  We  must  prove  that  (5  vanishes 
identically  off  a  bounded  subset  of  R^.  We  need  the  following 
result,  proved  in  Ref.  14. 


Result 

If  N  >  1  and  -•  (?,  then  F  €  if  and  only  if  F  e 

L2iR^)  and  F  extends  to  an  analytic  function  F:&'^  -'Cot 
exponential  type  (that  is,  there  exists  ,4  >  0  and  a  >  0  such 
that  I  FIs)  I  <i4exp(a|s|)  for  alls  e 

Now,  to  complete  the  proof  of  Corollary  3,  let  F(s)  and  G(s) 
be  the  analytic  continuations  of  Fix )  and  G  (x  >.  By  the  only 
if  part  above,  F(x)  e  and  F(s>  is  of  exponential  type. 

Hence  it  follows  from  Elq.  (7)  that  Gix)  e  and  that 

G(s)  is  of  exponential  type.  The  result  now  follows  from  the 
if  part  above. 


Fi,(w)  ={^|.A|exp(ia(p)w  +  i7)(p))  fj  (11) 

*-i\  hi 

where  L  =  Liu),  A  =  Aiu),  and  biv)  €  R  and  aiv)  €  R. 

Equation  (11)  implies  that  if  F,.(X)  =  0  then  Fi,(-X)  « 
F(-oX)  =  F_u(X)  =  0  for  X  e  <?.  Therefore,  after  a  suitable 
relabeling  of  indices,  we  may  write 

Fu(w)  =a/-|A|exp(ia(o)w  +  /6(o)]  f]  <^2) 

From  Eq.  (12)  it  follows  that  the  function  F(s);  C 

admits  the  factorization 

where  S(s)  satisfies  the  following  properties; 

(1)  S(s)  is  analytic  in  a  neighborhood  of  (0,0 . 0)  e 

C^, 

(2)  S(s)5(s)=^2(2,s)'-forse 

(3)  Sivt)  =  tMexp(io(o)t  +  i6(p)]  for  v  6  S^“',  t  €  R. 


APPENDIX  A: 

We  will  outline  a  proof  of  the  representation  theorem.  Define 
the  sphere  S^"'  *  (x  e  (x,  x  >  «  1).  Let  F  satisfy  the 
hypothesis  of  the  theorem,  and  for  any  u  6  S^~'  c  let 
F^it):  R  —C  be  defined  by 

Fu(t)  =  Fitv).  (9) 

It  can  be  demonstrated  that  F„  €  BiR).  Therefore  F„(t)  can 
be  extended  to  an  analytic  function  Fi,(w):  (?  -•  C  of  expo¬ 
nential  type. 

By  Hadamard’s  factorization  theorem,'®  Fuiu)  admits  the 
factorization 

Fu(w>  =  w^'^Ulolexplialolu]  H  [l  ~  .  I’  (10) 

*-i  I  X*(o)l 

where,  for  everj'  v  €  S^”',  Liu)  is  a  nonnegative  integer,  Aiv) 
6  C,  aiu)  €  R,  and  (XaIo))  are  the  infinite  set  of  zeros  of 
F,.(w). 

Furthermore,  the  set  (Ft.)  of  functions  forms  a  continuous 
family  of  functions  (in  the  topology  of  uniform  convergence 
on  compact  subsets)  parameterized  by  the  parameter  v  € 
S^~K  Hence,  by  RouchFs  theorem,'®  since  the  zeros  |X/i(i;)) 
form  a  discrete  subset  of  C  (they  have  no  limit  points),  the 
X*(ii)  are  continuous  functions  of  v  (after  a  suitable  permu¬ 
tation  of  indices). 

Now,  let  u  “  (1,0,0, . . .  ,0)  e  S'®'"'.  Since  |F|  is  radial, 
|F,,(0|  =  |Fu(t)|.  and  hence  F,.(«)F,.(«)  =  Fu («)/'u («)■ 
ThereforeL(o)  =  Liu),  jAiu)!  “  | /( (u )  |,  and  for  every  the 
function  X*(o):  S^”'  -*  C  is  continuous  and  S^~'  is  con¬ 
nected  (since  /V  >  1);  hence  the  set  (X*(ti):  V'  e  is 

connected.  However,  for  each  f  €  S'®’”'  it  follows  from  Elq. 
(10)  and  the  equality  |F,.(t)|  =  |Fu(f)|  that 

hit')  6  Xn(u)  u  |X*(i/)|. 

Since  the  latter  set  is  discrete,  its  only  connected  subsets  are 
single  points;  hence  each  function  Xn(i')  is  a  constant  X*. 
Hence  we  can  write 


We  will  prove  that  the  properties  above  imply  that  SU)  « 
Piz )exp(2Ti (xo,  2 ) )  for  some  xo  6  and  some  polynomial 

Piz)  having  the  form  given  in  either  part  (a)  or  part  (b)  of  the 
conclusion  of  the  theorem. 

First,  we  will  need  the  following  results; 

Lemma  I.  UN  •  2,  the  polynomial  iz,z)  factorizes  (over 
C)  into  irreducible  factors  as  iz,z)  “  zi*  +  22*  =  (21  +  i22)(2| 
- 122).  If  /V  2  3,  the  polynomial  (2,  z )  is  irreducible. 

Proof.  The  first  part  is  obvious.  If,  for  N  i  3,  (2, 2 )  is  not 
irreducible,  then  it  factors  as  the  product  of  two  linear  poly¬ 
nomials.  Hence,  there  exist  vectors  a,  b  e  R^  such  that  (z, 
2)  =  ((z,  a))((2,  6))  =  za'bz*  (where  *  denotes  transpose). 
Then  M  =  l/2(a.*6  -t  ba*)  is  a  matrix  having  rank  g2. 
However,  (2,  z)  =  zAfz  for  all  z  e  C  implies  that  M  is  the 
identity  matrix  that  has  rank  23.  The  contradiction  com¬ 
pletes  the  proof. 

Let  0  denote  the  set  of  functions  on  that  are  analUic  in 

some  neighborhood  of  (0,0 . 0)  e  C^.  An  element  /  e  0 

is  a  unit  if  (1//)  e  0.  Clearly,  /  is  a  unit  if  and  only  if 

/(0,0 . 0)  !>i  0.  An  element  /  e  0  is  reducible  over  0  is  it  can 

be  written  as/  =  g\g2,  where  neither,?!  nor?2  is  a  unit;  oth¬ 
erwise  it  is  irreducible. 

Lemma  2.  Every  nonunit  element  in  0  can  be  written  as 
a  finite  product  of  irreducible  factors,  and  such  a  product  is 
unique  up  to  the  order  of  its  factors  and  units. 

Proof.  This  is  a  statement  of  the  fact  that  0  is  a  unique 
factorization  domain.  A  proof  of  this  deep  result  is  given  in 
Ref.  17. 

Clearly,  not  every  irreducible  polynomial  is  irreducible  over 
0.  However,  the  following  particular  result  is  valid. 

Lemma3  The  polynomials  z  1  +  iz3and  (z.z)  (for  A'  5  3) 
are  irreducible  over  0. 

Proof.  If  /  and  g  are  nonunils  in  0,  then  their  Taylor  series 
expansions  have  no  constant  term  and  hence  fg  has  no  linear 
terms.  Therefore  fg  ^  zi  ±  iz'j.  ANo.  the  product  of  the 
linear  terms  of/ and?  yields  the  quadratic  term  of/?  that,  by 
Lemma  1,  ran  nex  er  equal  (z.z)  for  .V  ^  :t. 
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The  proof  of  the  theorem  is  concluded  as  follows.  From 
Properties  1  and  2  and  Lemmas  2  and  3,  it  follows  that  S(z) 
admits  the  factorization 


S(r)  »  P(z)H{z), 


(14) 
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where  P(z)  is  a  polynomial  having  the  form  in  part  (a)  of  the 
theorem  with  L  *  m  i  +  m2,  or  in  part  (b)  of  the  theorem  with 
L  *  2m,  and  H(z)  is  a  function  that  is  analytic  in  some 
neighborhood  of  the  origin  and  nonzero  at  the  origin.  Hence 
H(z)  admits  the  representation 

H(z)  =  expliC(z)l,  (15) 


where  C(z)  is  analytic  in  some  neighborhood  of  the  origin. 
Since  H{z)H{z)  =  1,  it  follows  that  C(z)  =  C(z);  hence  the 
coefficients  of  the  Taylor  expansion  for  G  are  real.  Com¬ 
bining  Property  3  and  Ek)s.  (14)  and  (15)  yields  either 
(a)  IfN«2. 

then 

G{vt)  =  a(u)t biv)  —  {mi  —  m2)f>,  (16) 

where 

6  =  arctan  (— ],  v  =  (I’l,  02)  e  R*. 


or 

(b)  lfN2  3. 
then 

H{vt)  ^  a(v)t  +  b{v).  (17) 


Examining  the  Taylor  series  expansion  for  G  shows  that  all 
terms  except  for  the  constant  and  linear  terms  vanish  and  that 
therefore  there  exist  iq  €  and  boe  R  such  that 


C(z)  ®  (*o.«)  (18) 

in  some  neighborhood  U  of  the  origin  in  C^'.  Therefore 

F(z)  =  P(z)exp(i (xo.  x )  +  it>o)  |l  " 
for  z  e  U. 

Since  F  is  entire  on  C^,  Eq.  (19)  holds  for  all  z  e  C^’.  The 
proof  is  concluded. 
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WB6.  Determining  the  Support  of  an  Object  from  the  Support 
of  Its  Autocorrelation  *  J.  R.  FIENUP  AND  T.  K  CRlMMINS,  Radar 
and  Optii!'  Diiisinn,  Enuironmental  Research  Institute  of  Michi/tan, 
P.O.  Box  StilS,  Ann  Arbor,  Michigan  48107. — In  astronomy,  x-ray 
crystallography,  and  other  disciplines,  one  olten  w  ishes  to  reconstruct 
an  object  distribution  trom  its  autocorrelation  or.  equivalently,  from 
the  modulus  of  its  Fourier  transform  (i.e.,  the  phase  retrieval  prob¬ 
lem).  It  is  also  useful  to  be  able  to  reconstruct  just  the  support  of  the 
object  (i.e..  the  region  on  which  it  is  nonzero).  In  some  cases,  for  ex¬ 
ample,  ui  find  the  relative  locations  of  a  number  of  pointlike  stars,  the 
object’s  support  is  the  desired  information.  In  addition,  once  the 
object's  support  is  known,  the  reconstruction  of  the  object  distribution 
by  the  iterative  method'  is  simplified.  We  show  several  methods  of 
finding  sets  which  contain  the  support  of  an  object,  based  on  the 
support  of  its  autocorrelation.  The  smaller  these  sets  are.  the  more 
information  they  give  about  the  support  of  the  object.  Particularly 
small  sets  containing  the  object’s  support  are  given  by  intersections 
of  its  autocorrelation’s  support  with  translates  of  its  auti.>correlation’s 
support.  It  will  be  shown  that  for  special  cases  this  gives  rise  to  a 
unique  reconst  rue  (on  of  the  support  of  the  object  from  the  support 
of  its  autocorrelation.  (13  min.) 

'  Work  support  by  AFOSR. 

‘ R.  Fienup.  Opt  Lett.  3.  27  ( 1978). 


Reprinted  from:  J.  Opt.  Soc.  Am.  70,  1581  (1980).  Presented  at 
the  1980  Annual  Meeting  of  the  Optical  Society  of  America, 
Chicago,  IL,  October  1980. 
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RECONSTRUCTION  OF  THE  SUPPORT  OF  AN  OBJECT 
FROM  THE  SUPPORT  OF  ITS  AUTOCORRELATION 

by 
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Environmental  Research  Institute  of  Michigan 
P.O.  Box  8618,  Ann  Arbor,  MI.  48107 

ABSTRACT 

The  phase  retrieval  problem  consists  of  the  reconstruction  of  an 
object  from  the  modulus  of  its  Fourier  transform,  or  equivalently 
from  its  autocorrelation.  This  paper  describes  a  number  of  results 
relating  to  the  reconstruction  of  the  support  of  an  object  from  the 
support  of  its  autocorrelation.  Methods  for  reconstructing  the  ob¬ 
ject's  support  are  given  for  objects  whose  support  is  convex  and  for 
certain  objects  consisting  of  collections  of  distinct  points.  The 
uniqueness  of  solutions  is  discussed.  In  addition,  for  the  objects 
consisting  of  collections  of  points,  a  simple  method  is  shown  for 
completely  reconstructing  the  object  functions. 
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M.P.  304,  P.O.  Box  5837,  Orlando,  Florida  32855 
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INTRODUCTION 

In  astronomy.  X-ray  crystallography  and  other  disciplines  one 
often  wishes  to  reconstruct  an  object  from  its  autocorrelation  or, 
equivalently,  from  the  modulus  of  its  Fourier  transform  {i.e.,  the 
phase  retrieval  problem)^.  It  is  also  useful  to  be  able  to  re¬ 
construct  just  the  support  of  the  object  {the  set  of  points  over 
which  it  is  nonzero).  In  some  cases,  for  example,  to  find  the  rela¬ 
tive  locations  of  a  collection  of  point-like  stars,  the  object's 
support  is  the  desired  information.  In  addition,  once  the  object's 
support  is  known,  the  reconstruction  of  the  object  by  the  iterative 

p 

method  is  simplified.  Therefore,  we  are  motivated  to  find  a  quick 
way  to  determine  the  support  of  the  object  from  the  support  of  its 
autocorrelation. 

In  the  general  case  there  may  be  many  solutions  for  the  object's 
support  given  the  autocorrelation  support.  In  what  follows  a  method 
for  generating  sets  containing  all  possible  solutions  is  given.  In 
addition,  for  the  special  case  of  convex  sets  a  method  for  generating 
a  family  of  support  solutions  is  described.  For  the  special  case  of 
objects  consisting  of  sets  of  discrete  points,  this  method  is  shown 
to  yield  a  unique  support  solution  unless  the  vector  separations  of 
the  points  in  the  object  satisfy  certain  redundancy-type  conditions. 
If  instead  of  manipulating  the  autocorrelation  support  one  uses  the 
autocorrelation  function,  then  for  the  same  objects  one  can  recon¬ 
struct  the  object  itself. 
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DEFINITIONS  AND  BACKGROUND 

The  results  shown  in  this  paper  apply  to  functions  on  Euclidean 
spaces  of  any  number  of  dimensions  except  where  otherwise  noted. 

For  simplicity  we  consider  only  objects  represented  by  real,  non- 
negative  functions,  f(x)  >  0,  where  xcE  (N-dimensional  Euclidian 
space).  The  su pport,  S,  of  a  nonnegative  function,  f(x),  is  the 
smallest  closed  set  such  that  the  integral  of  f(x)  over  the  comple- 
ment  of  S  in  E  is  zero.  (Roughly  speaking,  S  is  the  set  on  which 
f(x)  >0.)  In  this  paper  we  consider  only  functions  with  compact 
(i.e.,  closed  and  bounded)  support.  If  f(x)  is  a  finite  positive 
linear  combination  of  translates  of  the  delta  function,  then  S  is  a 
finite  set. 

We  will  be  making  use  of  linear  operations  on  sets.  Let  X  and  Y 
be  subsets  of  E^.  Then  the  addition  of  two  sets  and  multiplication 
by  scalars  is  defined  by 

aX  +  bY  5  {ax  +  by  :  xeX  and  yeY}  (la) 

where  a  and  b  are  real  numbers.  Similarly,  the  addition  of  a  point 
(which  can  be  though  of  as  a  vector)  xeE^  to  a  set  is  defined  by 

ax  +  bY  =  {ax  +  by  :  yeY}  (lb) 

where  again  a  and  b  are  real  numbers.  While  these  linear  operations 
on  sets  enjoy  some  of  the  properties  expected  from  addition  and  from 
multiplication  by  a  scalar,  other  properties  do  not  hold.  For  exam¬ 
ple  for  a  real  number  a  and  for  sets  X,  Y,  and  Z,  (X  +  Y)  +  Z  =  X  + 

(Y  +  Z);  a(bX)  -  (ab)X;  and  a{X  +  Y)  -  aX  +  aY.  However,  (a  +  b)X 
does  not  equal  aX  +  bX  except  for  special  cases.  The  role  of  zero  in 
this  case  is  played  by  the  set  {0)  consisting  of  the  single  point  0 
-  (0,  ...,  0)  e  e'^.  We  have  X  +  {0)  -  X  and  alOl-  {0).  But  X  -  X 
does  not  equal  {Q}  unless  X  consists  of  a  single  point. 


The  autocorrelation  of  f(x)  Is 
f^f(x)  . 


L  is 

/  f(y)  f(y  +  X)  dy 

(2a) 

E^ 

/  ^(y)  ^(y  -  x)  dy  . 

(2b) 

-N 


The  autocorrelation  of  f(x)  is  equal  to  the  inverse  Fourier  transform 
of  the  squared  modulus  of  the  Fourier  transform  of  f(x).  Note  that 
the  autocorrelation  is  (centro-)  symmetric:  f  ★■f(-x)  •  f  ★f(x). 

It  is  most  illuminating  to  interpret  Eq.  (2a)  as  a  weighted  sum  of 
translated  versions  of  f(x).  That  is,  in  the  integrand  of  Eq.  (2a), 
f(y)  acts  as  the  weighting  factor  for  f(y  +  x),  which  is  f(x)  trans¬ 
lated  by  -y.  If  the  support,  S,  of  a  nonnegative  integrable  func¬ 
tion,  f(x),  is  compact  and  if  A  is  the  support  of  its  autocorrelation 
function,  f^f(x),  then 


A  >  U  (S  -  y) 
yeS 


«  S  -  S  -  {x  -  y:  X,  y  c  S)  .  (3) 

The  proof  of  Eq.  (3)  is  in  Appendix  A.  Note  that  A  is  sytmietric; 

-A  =  A  (4) 

where  -A  «  {-x  :  x  c  A}.  In  addition, 

0  e  A  (5) 


as  long  as  S  is  non-empty.  To  illustrate  the  interpretation  of  an 
autocorrelation  support,  consider  the  case  of  the  two-dimensional 
support  S  shown  in  Figure  1(a),  having  the  form  of  a  triangle  with 
vertices  at  points  a,  b,  and  c.  The  autocorrelation  support  A  can 
be  thought  of  as  being  formed  by  successively  translating  S  so  that 
each  point  in  S  is  at  the  origin,  and  taking  the  union  of  all  these 
translates  of  S.  Figure  1(b)  shows  three  such  translates,  (S  -  a), 

(S  -  b),  and  (S  -  c).  The  rest  of  A  is  filled  in,  as  shown  in  Figure 
1(c),  by  including  all  (S  -  y)  such  that  y  c  S. 
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We  are  concerned  with  the  following  problem.  Given  a  symmetric 
set  A  C  find  sets  S  C  which  satisfy  A  =  S  -  S. 

Sets  S-i  and  S2,  which  are  subsets  of  are  equivalent, 

S,  -  $2  (6a) 

if  there  exists  a  vector  v  e  such  that 

$2  =  V  +  bS^  =  {v  +  Bx  :  X  e  S^}  (6b) 

where  b  =  +  1  or  -  1 .  From  £q.  3  it  is  easily  seen  that  if  $1  is 
a  solution  to  S  -  S  «  A,  and  if  $2  ~  S^,  then  $2  is  also  a  solution. 
If  S-j  is  a  solution  and  all  other  solutions  are  equivalent  to 
Sp  then  the  solution  is  said  to  be  unique  and  A  is  said  to  be 
unambiguous;  if  there  exist  any  nonequivalent  solutions,  then  the 
solution  is  nonunique  and  A  is  ambiguous.  For  example,  in  one 
dimension  the  set  of  points  A  «  (-1,  0,  1)  is  unambiguous,  having 
the  unique  solution  S  *  (0,  1);  whereas  the  set  of  points  A  *  {-3, 

-2,  -1,  0,  1,  2,  3}  is  ambiguous,  having  nonequivalent  solutions 
$1  s  (0,  1,3}  and  $2  *=  (0,  1,2,  3}, 

Not  all  symmetric  sets  that  contain  0  are  necessarily  autocorre¬ 
lation  supports,  as  the  following  example  shows.  As  shown  in  Figure 
2,  let  A  «  {(0,  0),  (1,  0),  (-1,  0),  (0,  1),  (0,  -1)}.  Because  of 
the  point  (1,  0),  a  solution  must  include  two  points  separated  by 
(1,  0).  Similarly  because  of  the  point  (0,  1),  a  solution  must  in¬ 
clude  two  points  separated  by  (0,  1).  Therefore,  the  solution  must 
have  at  least  three  distinct  noncolinear  points.  Of  the  three  pos¬ 
sible  pairings  of  the  three  points,  one  has  a  separation  along  (1, 

0),  a  second  has  a  separation  along  (0,  1),  and  the  third  pair  of 
points  must  have  a  separation  vector  which  is  not  on  the  horizontal 
or  vertical  axes.  However,  all  points  in  A  are  on  the  horizontal  or 
vertical  axes,  and  therefore  there  is  no  solution  for  A  »  S  -  S  in 
this  case. 
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A  set  X  is  convex  if  for  all  x,  y  c  X, 

tx  +  (1  -  t)y  c  X 


(7) 


for  all  t  e  [0,  1]  (that  is,  if  all  points  on  the  line  segment  be¬ 
tween  X  and  y  are  contained  in  X).  The  convex  hull  of  a  set  X,  de¬ 
noted  by  c.hull(X),  is  given  by  the  smallest  convex  subset  of 
containing  X.  Thus  X  is  convex  if  and  only  if  X  -  c.hull(X).  If  S 
is  convex,  then  A  «  S  -  S  is  also  convex.  More  generally, 

c.hull(X  -  X)  -  c.hull(X)  -  c.hull(X).  (8) 
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3 

LOCATOR  SETS 

In  many  cases  A  is  ambiguous,  and  so  it  would  be  useful  to  define 

a  set  that  contains  all  possible  solutions  to  the  equation  A  *  S  -  S. 
N 

A  set  LS  E  is  defined  as  a  locator  set  for  A  if  for  every  closed 
N 

set  S  C  E  satisfying  A  =  S  -  S,  some  translate  of  S  is  a  subset  of 
L,  i.e.,  there  exists  a  vector  v  such  that 

V  +  S  c  L.  (9) 

There  are  many  ways  to  generate  locator  sets.  For  example,  for 
ve  S,  S-vSS-SaA,  and  so  A  itself  is  a  locator  set.  The 
smaller  the  locator  set,  the  more  tightly  it  bounds  the  possible 
solutions,  and  the  more  informative  it  is.  Consequently,  we  wish  to 
find  locator  sets  that  are  as  small  as  possible.  A  smaller  locator 
set  than  A  is 

L  -  AO  H  (10) 

N 

v^ere  H  is  any  closed  half-space  of  E  with  the  origin  on  its 
boundary.  To  see  this,  choose  v  e  S  such  that  S  -  v  C  H.  Then  since 
S  -  V  c  A,  it  follows  that  S-vcL-AHh.  A  locator  set  that  is 
often  still  smaller  can  be  shown  to  be 

L-^P  (11) 

where  P  is  any  N-dimensional  parallelepiped  (in  two  dimensions: 
parallelogram)  containing  A. 

A  particularly  interesting  locator  set  is  given  by  the  following 
intersection  of  two  autocorrelation  supports.  If  w  e  A,  then 

L  -  AO  (w  +  A)  (12) 

is  a  locator  set  for  A.  Note  that  L  is  symmetric  about  the  point 
w/2.  The  proof  that  this  is  a  locator  set  is  as  follows.  Suppose  S 
satisfies  A  -  S  -  S.  Since  w  c  A,  there  exist  u,  v  c  S  such  that  w 
«  u  -  V.  Consider  z  c  S  -  v.  Then  z  «  s  -  v  where  scS,  z-s-v 
e  A,  and  z»s-u  +  (u-v)  «s-u  +  wcA  +  w.  Therefore,  z  e  A 
n  (A  +  w)  «  L  and  therefore  S  -  v  e  L. 


F-7 


r 


Naturally,  the  most  interesting  (smallest)  locator  sets  generated 
by  this  method  of  intersecting  two  autocorrelation  supports  are  ob¬ 
tained  by  choosing  w  to  be  on  the  boundary  of  A.  By  choosing  differ¬ 
ent  points  w  c  A,  a  whole  family  of  locator  sets  can  be  generated  by 
this  method. 


The  locator  set  defined  by  Eq.  (12)  can  be  shown  to  be  a  special 
/ 

case  of  the  following  more  general  locator  set.  Let  W  be  a  set  con¬ 
tained  in  some  S'  '•  S  for  every  set  S  satisfying  S  -  S  *  A.  That 
is,  W  is  an  intersection  of  translates  of  all  possible  solutions. 

Then*'''.- 


L  »  n  (w  +  A) 

WcW 


(13) 


is  a  locator  set  for  A.  The  proof  is  as  follows.  Suppose  that  S  -  S 
«  A  and  S  ~  S'  with  W  S  S.  Then  S'  -  p  c  A  for  every  p  e  S' .  Hence 

S'  e  n  (p  +  A)  c  n  (p  +  a).  (ia) 

peS'  pcW 

Although  Eq.  (13)  has  the  potential  for  producing  particularly 
small  locator  sets,  its  practicality  is  limited  by  the  fact  that  W 
is  defined  by  all  possible  solutions  to  S  -  S  »  A,  but  that  is  what 
is  assumed  to  be  unknown.  Nevertheless,  one  can  make  some  use  of 
Eq.  (13).  For  example,  if  S  -  S  »  A  and  w  e  A,  then  there  exists  a 
translate  S'  of  S  such  that  w  c  S'  and  0  e  S'.  Hence,  we  can  use  W 
=  (0,  w},  which  reduces  Eq.  (13)  to  Eq.  (12). 


Example  1. 

Consider  the  set  S  shown  in  Figure  3(a),  consisting  of  two  balls 
joined  by  two  thin  rods,  and  its  autocorrelation  support  A  ■  S  -  S 
shown  in  Figure  3(b).  An  example  of  a  locator  set  1/2  P  is  shown  in 
Figure  3(c);  it  is  very  suggestive  of  the  approximate  size  of  S,  but 
not  of  any  of  the  details  of  the  shape  of  S.  Figure  3(d)  shows  the 
generation  of  the  locator  set  L  ■  A  n(w  +  A)  for  a  particular 
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point  w  e  A.  Figure  3(e)  and  3(f)  show  two  other  members  of  this 
family  of  locator  sets  generated  with  two  other  points  w  e  A.  These 
locator  sets  generated  by  intersecting  two  autocorrelation  supports 
are  very  suggestive  of  the  shape  of  the  solution  (or  solutions). 

This  is  especially  true  if  one  realizes  that  any  solution  must  be 
contained  within  some  translate  of  each  of  these  locator  sets.  Un¬ 
fortunately,  for  the  general  case  it  is  difficult  to  narrow  down  the 
solution  any  further:  a  way  to  combine  the  information  from  two  or 
more  of  the  family  of  locator  sets  has  not  been  devised  for  the 
general  case.  However,  as  will  be  shown  in  the  sections  that  follow, 
for  special  classes  of  sets  much  more  can  be  done. 

Example  2 

Consider  the  set  A  consisting  of  a  circle  of  radius  1.  Figures 
4(a)-4(c)  show  the  locator  set  A  n  H,  1/2  P,  and  A  n(w  +  A),  re¬ 
spectively.  In  addition.  Figure  4(d)  shows  still  another  locator  set 

4 

for  A,  a  circle  of  radius  1//3,  which  is  due  to  Jung's  theorem  .  , 

The  areas  of  the  four  locator  sets  are  ir/2  =  1.571  for  the  half  , 
circle,  1.000  for  the  square  (parallelogram)  with  sides  of  length 
one,  2ir/3  -  /3l2  =  1.228  for  the  intersection  of  two  circles,  and 
ir/3  =  1  .047  for  the  circle  of  radius  1//T.  Consequently,  1/2  P  has 
the  smallest  area  of  the  locator  sets  considered  in  this  case.  In 
other  cases,  such  as  in  Example  1  above,  A  n  (w  +  A)  may  have  a 
smaller  area  than  1/2  P.  For  example,  the  locator  set  shown  in 
Figure  3(e)  has  a  smaller  area  than  the  locator  set  shown  in  Figure 
3(c).  Furthermore,  as  mentioned  earlier,  locators  of  the  form 
A  n (w  +  A)  tend  to  be  more  suggestive  of  the  shape  of  the  possible 
solutions. 
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4 

CONVEX  SETS 

A  number  of  interesting  results  hold  for  objects  having  convex 
support.  In  the  following,  statements  are  made  about  the  uniqueness 
of  convex  solutions  to  A  =  S  -  S  for  convex  sets  A,  and  methods  of 
detemining  solutions  are  given. 

All  convex  symmetric  sets  A  have  at  least  one  solution 

S  -  ^  A  =  {x/2  :  X  €  A}  (15) 

The  proof  is  as  follows.  Let  u,  v  c  1/2  A.  Then  2u  c  A,  2v  c  A  and 
-2v  e  A.  Therefore,  u  -  v  «  1 /2  (2u)  +  1/2  (-2v)  c  A,  since  A  is 
convex,  and  so  (1/2  A)  -  (1/2  A)  c  A.  Now  let  v  c  A.  Then  v/2  c 
1/2  A  and  -v/2  c  1/2  A.  Therefore  v  -  (v/2)  -  (-v/2)  c  (1/2  A)  - 
(1/2  A)  and  so  A  c  (1/2  A)  -  (1/2  A).  Therefore,  for  convex  A, 

A  -  (]a)  -  (]a)  (16) 

For  the  one-dimensional  convex  case  the  result  is  trivial:  the 
autocorrelation  support  A  is  just  a  line  segment,  and  a  unique  solu¬ 
tion  is  given  by  S  ■  1/2  A,  which  is  just  a  segment  of  the  line  half 
the  length  of  the  line  segment  A.  An  equivalent  result  for  the  one¬ 
dimensional  convex  case  is  the  solution 

S  =  A  n(w  +  A)  (17) 

where  w  is  on  the  boundary  of  A  (at  one  end  of  the  line  segment  A), 
or  in  symbols  w  e  3(A). 

4.1  AUTOCORRELATION  TRI-INTERSECTION  FOR  CONVEX  SETS 

For  the  two-dimensional  convex  case,  we  have  the  following  re- 

2 

suit.  Let  A  S  E  be  a  closed  convex  symmetric  set  (-A  ■  A)  with 
non-null  interior,  and  let 


Furthermore,  let 


B  -  A  n  (w^  +  A)  n  (W2  +  A).  (19) 

Then  B  is  a  solution  to  A  =  S  -  S,  that  is, 

A  «  B  -  B.  (20) 

The  proof  of  this  result  is  in  Appendix  B.  Since  Wi  can  be  any 
point  on  the  boundary  of  A,  Eq.  (19)  results  in  a  family  of 
solutions. 

Example  3 

Consider  the  set  S  shown  in  Figure  5(a),  which  is  the  convex  hull 
of  the  set  sho>wi  in  Figure  3(a).  Its  autocorrelation  support  A  = 

S  -  S  [which  is  the  convex  hull  of  Figure  3(b)]  is  shown  in  Figure 
5(b).  The  parallelogram  shown  in  Figure  3(c)  is  a  locator  set  for 
A.  A  member  of  the  family  of  locator  sets  A  n(w  +  A)  is  shown  by 
the  intersection  of  A  and  w  +  A  in  Figure  5(c).  A  member  of  the 
family  of  solutions  B  is  shown  by  the  intersection  of  the  three  sets 
A  n  (w^  +  A)n(w2  A)  in  Figure  5(d).  Two  other  examples  of  B  ob¬ 
tained  using  different  points  w-j  and  Wg  are  shown  in  Figures  5(e) 
and  5(f). 

4.2  THREE-DIMENSIONAL  INTERSECTIONS  OF  CONVEX  SETS 

For  convex  sets,  since  in  one  dimension  the  intersection  of  two 
sets,  Eq.  (17),  results  in  the  solution,  and  since  in  two  dimensions 
the  intersection  of  three  sets,  Eq.  (19),  results  in  solutions,  one 
might  hope  that  in  three  dimensions  the  set 

c  -  A  n  (w^  +  A)  n  (wg  +  A)  n  (W3  +  a)  (21 ) 

would  be  a  solution  to  S  -  S  «  A,  where  w^  c  9(A),  W2  c  9(A)  n  9(w^ 

+  A),  and  w^  e  9(A)  n  9(w^  +  A)  n  9(w2  +  A).  Unfortunately,  this  is 
generally  not  the  case. 
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A  counter-example  to  C  -  C  »  A  is  the  following.  Consider  S 
equal  to  a  sphere  of  diameter  one,  then  A  =  S  -  S  is  a  sphere  of 
radius  one  centered  at  the  origin.  Figures  6(a)  and  6(b)  show 
planar  cuts  through  the  centers  of  S  and  A,  respectively.  Figure 
6(c)  shows  a  planar  cut  through  A  n  (w-j  +  A)  H  (^2  *  through  the 
three  points,  0,  w^  and  A  O (w^  +  A)  n  (^2  * 
vertices,  one  in  front  of  the  plane  of  the  page  and  one  behind  the 
plane  of  the  page,  both  at  distance  one  from  the  centers  of  each  of 
the  three  intersecting  spheres.  Taking  the  intersection  of  this 
with  (w^  +  A),  which  is  centered  at  one  of  the  two  vertices,  gives 
us  C,  which  is  similar  to  a  regular  tetrahedron  (it  has  the  same 
vertices)  but  having  spherical  surfaces  of  radius  one  and  centers  at 
the  opposite  vertices  in  place  of  the  four  plane  faces  of  a  tetra¬ 
hedron.  Looking  for  a  moment  at  the  tetrahedron  T  having  the  same 
vertices  as  C  (i.e.,  the  convex  hull  of  points  0,  Wi,  w^,  and  w^ 
having  edges  of  length  one),  we  see  that  T  -  T  is  a  cuboctahedron, 
which  has  eight  triangular  faces  and  six  square  faces.  Since  T  Q  c, 
T  -  T  S  C  -  C.  The  surface  of  C  -  C  can  be  subdivided  into  fourteen 
patches  associated  with  the  fourteen  faces  of  the  cuboctahedron.  It 
can  be  shown  that  the  eight  patches  associated  with  the  triangular 
faces  coincide  exactly  with  the  surface  of  the  sphere  A  of  radius 
one.  However,  the  six  patches  corresponding  to  the  square  faces  do 
not.  For  example,  the  distance  from  the  origin  to  the  center  of 
each  of  those  six  patches  is  equal  to  the  distance  between  the 
centers  of  two  non-adjacent  edges  of  C.  This  distance  can  be  shown 
to  be  -  /2/2  =  1.0249.  That  is,  the  radius  of  C  -  C  is  greater 
than  that  of  the  sphere  A  by  about  2.49  percent  at  those  points. 
Hence,  C  -  C  ^  A. 
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4.3  LINEAR  COMBINATIONS  OF  CONVEX  SOLUTIONS 

Returning  to  the  N-dimensional  case,  if  S-j  and  S2  are  solu¬ 
tions  to  convex  A  =  S  -  S,  then 

S^  .  tS^  +  (1  -  t)S2  (22) 

is  also  a  solution  for  0  <.  t  £  1.  The  proof  of  this  result  is  as 
follows 

S^  -  S^  =  [tS^  +  (1  -  t)S2]  -  [tS,  +  (1  -  t)S2] 

=  tS^  -  tS^  +  (1  -  t)S2  -  (1  -  t)S2 

(23) 

»  tA  +  (1  -  t)A 
»  A 

since  A  is  convex. 

If  S^  is  a  solution,  then  so  is  -  S^.  Then  using  t  *  1/2 
and  S2  =  -  $1  in  Eq.  (22),  it  is  seen  that 

^1/2  *  7  ^1  '  7  ^1  *  7  ^ 

is  a  solution,  as  was  previously  shown  by  Eq.  (16). 

Eq.  (22)  can  easily  be  generalized  as  follows.  If  Sp  ..., 

are  solutions  for  convex  A,  and  if  t,,  ...,  t  >0  and  t,  + 

t,  +  ...  +  t„  »  1 ,  then 
2  n 

S  ■  I,  ‘i  Si  125) 

is  also  a  solution. 

In  the  two-dimensional  case,  if  B-j  and  62  are  solutions  obtained 
from  the  tri-intersection  method  of  Eq.  (19),  then  tB-j  +  (1  -  t)B2  is 
a  solution  which  usually  cannot  be  generated  by  the  tri-intersection 
method.  Thus,  new  solutions  can  be  obtained  by  this  method. 
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Example  4 

Consider  the  two-dimensional  convex  set  S  shown  in  Figure  6(a), 
consisting  of  a  circle  of  diameter  one.  A  =  S  -  S,  consisting  of  a 
circle  of  radius  one  is  shown  in  Figure  6(b),  and  a  tri-intersection 
solution  B  is  shown  as  the  intersection  of  three  circles  in  Figure. 
6(c).  This  solution  is  analogous  to  an  equilateral  triangle  but 
having  arcs  of  circles  of  radius  one  with  centers  at  the  opposite 
vertices  for  each  of  the  three  sides.  It  can  easily  be  seen  that 
all  other  solutions  B  generated  by  Eq.  (19)  are  similar  to  the  one 
shown  in  Figure  6(c)  except  rotated  in  the  plane.  The  circle  of 
diameter  one  shown  in  Figure  6(a)  is  not  of  this  form,  but  it  is  also 
a  solution  to  A.  As  shown  by  Eq.  (24),  S  =  1/2  A  in  Figure  6(a)  can 
be  generated  by  applying  Eq.  (22),  using  S.j  =  -  $2  =  B  and  t  =  1/2. 
One  of  a  family  of  additional  solutions  generated  by  Eq.  (22)  is 
shown  in  Figure  6(d).  It  was  generated  using  S.|  =  1/2  A  in  Figure 
6(a),  $2=8  in  Figure  6(c),  and  t  =  1/2. 


4.4  THE  AMBIGUITY  OF  CONVEX  SETS 

We  now  consider  the  question  of  uniqueness  of  convex  solutions 
of  A  *  S  -  S  for  convex  A.  As  mentioned  earlier,  S  =  1/2  A  is  a 
solution.  If  all  convex  solutions  are  equivalent  to  1/2  A,  then  A 
is  said  to  be  convex-unambiguous.  It  was  shown  that  in  two  dimen¬ 
sions  one  can  generate  a  family  of  solutions  by  Eq.  (19),  the  member 
of  the  family  being  determined  by  the  choice  of  w.| .  Eq.  (22)  or 
(25)  can  then  be  used  to  generate  still  nwre  solutions.  Therefore 
one  would  suppose  that  convex  sets  A  are  generally  convex-ambiguous. 
However,  it  is  also  possible  that  all  solutions  generated  by  Eq.  (19) 
are  equivalent,  in  which  case  A  would  be  convex-unambiguous. 

In  what  follows  it  is  shown  that  in  two  dimensions  if  A  is  a 
parallelogram  then  A  is  convex-unambiguous.  Let  A  be  a 
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paral lelogram  having  vertices  w-n  -  Wp  w^,  and  -w^.  By  Eq.  (12)  a 
locator  set  for  A  is  L  *  A  O  (w^  +  A)  since  w.|  c  A.  It  is  easily 

seen  that  L  *  1/2  Wi  +  1/2  A,  and  so  L'  *  1/2  A,  which  has  vertices 

1/2  Wp  -  1/2  Wp  1/2  w^,  -  1/2  Wg  is  a  locator  set  for  A.  Suppose 

A  =  S  -  S  where  S  is  convex.  Then  some  translate  of  S,  call  it  S', 

is  contained  in  L'.  Since  w^  e  A  there  exist  u,  v  c  S'  such  that 
w-j  =  u  -  V.  Since  S'  Q  L',  then  u,  v  c  L'.  It  follows  that  u  =  1/2 
w-j  and  V  »  -  1/2  Wp  Therefore,  1/2  w.|  e  S'  and  -  1/2  w-j  c  S'. 
Similarly  1/2  w^  c  S'  and  -  1/2  w^  e  S'.  Then,  since  S'  is  convex, 

L'  .  c.hull  Wp  -  2  Wp  2  ^2'  ~  2  '^2}]  “ 

Therefore,  S'  -  L'  =  1/2  A,  and  so  S  is  unique  among  convex 
solutions. 

It  can  also  be  shown  that  parallelograms  are  the  only 
two-dimensional  convex-unambiguous  sets,  and  convex  symmetric  sets  A 
Q  E  that  are  not  parallelograms  can  be  shown  to  have  infinitely 
many  nonequivalent  solutions  to  A  «  S  -  S.  The  lengthy  proof  of 
this  last  result  is  omitted  here  for  the  sake  of  brevity. 
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5 

AUTOCORRELATION  TRI-INTERSECTION  FOR  COLLECTIONS  OF  POINTS 

For  the  special  case  of  certain  finite  sets  consisting  of  a  col¬ 
lection  of  distinct  points,  the  solution  can  be  generated  by  a  method 
similar  to  the  one  for  convex  sets.  For  example,  the  function 

M 

fM  .  Y.  «(»  -  xj  (27) 

m«l 

consisting  of  M  delta  functions  having  amplitudes  f  >  0,  at  the 

N  ^ 

distinct  points  x^^j  e  E  ,  m  ■  1,  ...,  M,  would  have  support 

S  -  {Xjj,  :  m  -  1,  ...,  M}.  (28) 

Let  S  be  a  set  consisting  of  a  collection  of  distinct  points  and 
let  A  «  S  -  S.  Define  the  following  three  conditions  on  the  set  S, 
which  are  needed  for  the  results  that  follow. 

Condition  1; 

Whenever 

Xp  x^,  yp  y2,  Zp  e  S,  x^  ^  X2,  and 

xi  -  X2  +  yi  -  y2  +  Zl  -  Z2  -  0,  (29) 

then 

x^  -  or  Z2,  and  X2  -  y^  or  Zp 

Condition  2: 

Whenever  the  set  G  £  A  consists  of  three  distinct  points, and 
0  e  G,  and  G  -  G  Q  A,  then  G  is  equivalent  to  a  subset  of  S. 


Condition  3: 


Whenever  x^,  y^,  >2  e  S,  ^  and  x^  -  X2  =  y-j  -  y2; 

then  x-j  -  y-j . 

The  meaning  of  Conditions  1  and  3  are  discussed  in  Section  7. 

Condition  3  is  equivalent  to  saying  that  no  two  vector  spacings  be¬ 
tween  any  distinct  pairs  of  points  in  S  are  equal. 

Now  define  the  set  B  as  follows.  Let  w^  e  A  and  e  k  C\  (w-j 
+  A),  with  0  ^  w-j  ^  W2  ^  0,  and  let 

B  =.  A  n  (w^  +  A)  n  (w2  +  A).  Do ) 

We  have  the  following  three  results,  which  hold  for  any  number 
of  dimensions. 

1.  If  S  satisfies  Condition  1,  then 

S  ~  B  (31) 

That  is,  B  is  the  unique  solution  to  A  »  S  -  S. 

2.  If  S  satisfies  Condition  2,  then  S  is  equivalent  to  a  subset 
of  B. 

3.  If  S  satisfies  Conditions  2  and  3,  then  again  S  -•  B.  In 
fact,  S  satisfies  Conditions  2  and  3  if  and  only  if  it 
satisfies  Condition  1. 

The  proofs  of  these  three  results  are  in  Appendix  C.  Since  it 
requires  a  special  relationship  between  the  points  in  S  in  order  to 
violate  Condition  1,  it  is  probable  that  for  S  comprised  of  randomly 
located  points,  B  is  the  unique  solution  to  A  »  S  -  S.  More  will  be 
said  about  this  later. 

Example  5 

Consider  the  set  S  consisting  of  the  collection  of  9  points  shown 
in  Figure  7(a).  A  -  S  -  S  shown  in  Figure  7(b)  has  9^  -  9  +  1  ■ 

73  points.  Intersecting  A  with  a  translate  of  itself  using  Eq.  (12), 

3 


I' 
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a  number  of  different  locator  sets,  L,  for  A  can  be  formed,  two  of 
which  are  shown  in  Figures  7(c)  and  7{d).  (Each  locator  set  must 
contain  some  translate  of  any  solution  to  A  ■  S  -  S.)  Taking  the 
intersection  of  L  in  either  Figure  7(c)  or  7(d)  with  a  translate  of 
A  centered  on  any  point  within  L  yields,  according  to  Eq.  (29),  the 
solution  B,  which  is  found  to  be  equivalent  to  S  in  Figure  7(a). 

For  this  example,  for  all  allowable  values  of  w^  and  ^2  B  is 

found  to  be  equivalent  to  S,  which  is  shown  in  Figure  7(a);  that  is, 
the  solution  B  is  unique. 

Example  6 

Consider  the  set  S  consisting  of  the  collection  of  nine  points 
shown  in  Figure  8(a).  The  positions  of  eight  of  the  points  in  S  are 
identical  to  eight  of  the  points  of  the  set  shown  in  Figure  7(a). 

The  ninth  point  in  S  (in  the  lower  center)  was  moved  in  such  a  way 
as  to  make  equal  the  vector  spacing  between  two  pairs  of  four  dis¬ 
tinct  points.  That  is,  there  are  four  distinct  points,  Xp  X2,  y^, 
y2,  in  S  satisfying  x^  -  X2  *  y^  -  y2*  This  violates  Condition  3  and 
hence  also  Condition  1.  Therefore,  B  -  B  is  not  necessarily  equal 
to  A,  where  B  is  given  by  Eq.  (30).  A  *  S  -  S  shown  in  Figure  8(b) 
has  only  69  points,  compared  with  73  for  the  previous  example.  The 
redundancy  of  the  vector  spacings  (the  differences)  in  S  results  in 
a  two-fold  redundancy  in  four  of  the  points  of  A  (at  x^  -  X2  »  y^  -  y2, 

X2  -  Xi  »  y2  -  yp  ^1  -  yi  ■  ^2  ~  ^2’  ^1  "  ^1  *  ^2  ■  ^2^’ 

Figures  8(c),  8(d)  and  8(e)  show  three  of  the  locator  sets  for  A  that 

are  formed  using  Eq.  (12).  Once  again,  each  locator  set  must  contain 

some  translate  of  any  solution  of  A  -  S  -  S.  Therefore,  for  any 

solution  there  must  exist  a  point  v  such  that  a  translate  of  the 

solu-ion  is  a  subset  of  O  (v  +  L2)  where  and  L2  are  locator 

sets  for  A.  In  addition,  S  must  contain  at  least  9  points,  since  if 

2 

it  contained  only  8  points,  then  A  could  contain  at  most  8  -  8  1 

«  57  points.  Trying  all  possible  translations  of  the  locator  set 
shown  in  Figure  8(d),  only  two  of  its  intersections  with  the  locator 
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set  shown  in  Figure  8(c)  have  at  least  nine  points:  the  set 
shown  in  Figure  8(f),  and  a  translate  of  -  .  Any  solution  there¬ 
fore  must  be  equivalent  to  or  a  subset  of  is  found 

to  have  more  than  the  69  points  in  A,  and  A  C  “  ^1*  Therefore, 
since  has  ten  points  and  any  solution  must  have  at  least  nine 
points,  it  follows  that  any  solution  must  have  exactly  nine  points. 
Trying  other  pairs  of  locator  sets  for  A,  depending  on  the  pair  of 
locator  sets  chosen,  we  often  get  intersections  containing  ten 
points,  but  the  tenth  point  will  be  different,  such  as  in  the  set 
shown  in  Figure  8(g).  The  only  possible  solution  is  obviously  S  in 
Figure  8(a)  (that  is,  the  solution  is  unique),  since  it  is  the  only 
nine-point  set  that  is  equivalent  to  subsets  of  both  the  sets  shown 
in  Figures  8(f)  and  8(g).  Furthermore,  if  one  takes  intersections 
of  translates  of  the  two  particular  locator  sets  shown  in  Figures 
8(c)  and  8(e),  then  the  only  resulting  set  of  nine  or  more  points  is 
S  in  Figure  8(a);  that  is,  by  the  lucky  choice  of  which  two  locator 
sets  to  intersect,  the  solution  can  be  found  immediately. 

Equivalently,  it  can  be  shown  that  there  are  values  of  w^  and  W2 
such  that  B  ~  S,  although  that  is  not  true  for  most  values  of  w^  and  w 

Therefore,  even  when  Condition  1  is  not  satisfied  it  is  sometimes 
possible  to  find  solutions  (and  the  solution  may  even  be  unique  as 
it  was  in  Example  6)  by  intersecting  three  or  more  translates  of  A. 
However,  when  Condition  1  is  not  satisfied,  then  there  is  no  guaran¬ 
tee  that  the  solution  is  unique,  and  finding  solutions  is  consider¬ 
ably  more  complicated  than  simply  evaluating  B  by  Eq.  (30).  Unfor¬ 
tunately,  given  A  it  is  not  possible  to  immediately  determine  whether 
Condition  1  is  satisfied.  A  necessary  condition  that  Condition  1 

(or  Condition  3)  be  satisfied  is  that  the  number  of  points  in  A  can 
2 

be  expressed  as  M  -  M  +  1  where  M  1  is  an  integer. 
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RECONSTRUCTION  OF  OBJECTS  CONSISTING  OF  COLLECTIONS  OF  POINTS 

By  a  simple  modification  of  the  method  described  in  the  previous 
section  for  reconstructing  the  support  of  an  object  consisting  of  a 
collection  of  distinct  points,  it  is  often  possible  to  reconstruct 
the  object  itself.  The  method  is  analogous  to  using  Eq.  (30)  to  com¬ 
pute  B.  Recall  that  in  computing  B  one  takes  the  intersection  of 
three  translates  of  the  autocorrelation  support.  If  one  takes  the 
product  of  three  translates  of  the  autocorrelation  function  of  f(x), 
using  the  same  translations  as  used  to  compute  B,  then  the  support 
of  that  product  will  be  B.  And  if,  as  described  earlier.  Condition 
1  is  satisfied,  then  B  is  a  solution  to  A  *  S  -  S;  and  therefore  the 
support  of  that  product  is  equivalent  to  the  support  of  f(x).  In 
v^at  follows  it  is  shown  that  when  Condition  1  is  satisfied,  f(x) 
can  be  reconstructed  from  that  product  in  a  very  simple  way  [using 
Eqs.  (38)  to  (40)]. 

Suppose  that  the  object  is  given  by  Eq.  (27),  consisting  of  M 
delta  functions  located  at  the  distinct  points  x^^  having  amplitudes 
f^,  m  -  1,  2,  ...,  M.  The  positions  x^  are  vectors  in  any  number 
of  dimensions.  The  autocorrelation  is 

f^f(x)  -  /f(y)  f(y  +  x)  dy 

e'^ 

M  M 

■  f  S,  \  *  ’‘n> 

n*)  m«l 

which  can  be  expressed  as 

M  M 

f*f(x)  .  Y.  ‘M  *  Y  Z  Vm'l*  -  ’'m  *  *nl 
n>1  n>l  m^n 

o 

which  has  M*-  terms  located  at  positions  x  ■  x„  -  x„,  M  of  which  are 
at  X  -  0.  That  is,  it  has  up  to  -  M  +  1  distinct  terms.  For 
this  type  of  object,  the  fact  that  ttie  support  of  the  autocorrelation 
is  given  by  A  -  S  -  S  is  obvious  from  Eq.  (32). 


F-20 


At  this  point  we  would  like  to  take  the  product  of  two  such  auto¬ 
correlation  functions;  however,  the  product  of  two  delta  functions 
is  not  well  defined.  Several  approaches  to  overcome  this  difficulty 
are  possible.  For  simplicity  we  define  the  product  of  two  delta 
functions  as  follows: 


[a5(x  -  )][b6(x  -  X2)] 


'ab«(x  -  x^),  X2  *  x^ 

^  0  ,  X2  ^  x^ 


(33) 


It  is  intuitively  helpful  to  think  of  f(x)  in  Eq.  (27)  as  a  digitized 
array  of  sampled  values,  with  the  values  f^^  at  addresses  x^^,  m  *  1, 

2,  ...,  M.  Then  the  delta  functions  in  this  section  can  be  thought 
of  as  Kronecker  delta  functions. 


As  a  first  step  toward  forming  a  product  analogous  to  B  of  Eq. 
(30),  we  consider  the  product  of  f  ★f(x)  and  f  ★f(x  -  w),  where  w 
e  A,  and  we  choose  w  ^  0.  From  Eq.  (32)  it  is  evident  that  w  e  A  is 
of  the  form  Xj  -  Xj^  where  x^,  Xj^  c  S.  Therefore,  we  are  taking  the 
product  of  f  ★f(x)  and  f  ★f(x  -  Xj  ^i^).  where  Xj  -  Xj^  jt  0  lies 
within  the  support  of  f^f(x).  The  center  of  the  translated  auto¬ 
correlation  lies  within  the  support  of  the  untranslated  autocorre¬ 
lation.  Using  Eq.  (32b),  the  autocorrelation  product  is  (all  sum¬ 
mations  are  from  1  to  M  unless  otherwise  noted) 


APji^(x)  .  [f  ★f(x)][f  ★f(x  -  Xj  +  x,^)] 


*«k> 


*  E  E 


n'  m'^n'  ^  m  n  j  k 


(34a) 
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-SO', 


x^) 


*  f/k  -  V  *  “k' 

m^k,j 


(34b) 


v^ere  (O.T.)  denotes  "other  terms"  as  will  be  described  later.  [As 
an  example  of  how  Eq.  (34b)  follows  from  Eq.  (34a),  the  fourth  term 
in  Eq.  (34b)  arises  from  the  product  of  the  second  term  of  the  first 
autocorrelation  with  the  second  term  of  the  second  autocorrelation, 
with  m  ■  j,  n'  *  n,  and  m'  »  k.]  Using  Eq.  (32a),  another  way  of 
expressing  Eq.  (34)  is 


'''’jk<«>  -I  r  I  i:  Vmfn'V'f" 

n  m  n'  m' 

.«(X  -  x^.  +  x^.  -  X.  +  x^) 

from  which  it  is  seen  that  terms  survive  at  points  x  where 
^  -  ’^m  -  "n  “  ’'m'  -  ’^n*  \ 


(34c) 


(35) 


The  terms  shown  in  Eq.  (34b)  all  necessarily  appear.  In  addition, 
other  terms  may  appear,  as  indicated  by  "+  (O.T.)".  The  existence 
of  other  terms  depends  on  the  presence  of  special  relationships  be¬ 
tween  the  coordinates  x^^  allowing  Eq.  (35)  to  be  satisfied  "by 
chance".  There  being  no  additional  terms  is  equivalent  to  Condition 
1  (described  in  the  previous  section)  being  satisfied.  If  the  x^^^ 
were  independent  random  variables,  then  the  chance  of  having  addi¬ 
tional  surviving  terms  would  be  small,  and  we  would  have  (O.T.)  ■  0. 

Combining  Eq.  (27)  with  (34b),  the  autocorrelation  product  can 
be  expressed  as 
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*  -k)  *  "  »j)] 

1  Z  ^n  )  ^  ■  '‘j  ^ 

\(>A,j  / 

Therefore,  there  are  translates  of  the  supports  of  both  f(x)  and 
f(-x)  which  are  contained  within  the  support  of  APj|^(x).  This  can 
also  be  seen  from  the  fact  that  by  Eq.  (12)  the  support  of  APj.|^(x) 
is  a  locator  set. 

Now  consider  the  second  step  toward  forming  a  product  analogous 
to  B  of  Eq.  (30):  we  take  the  product  of  APj|^(x)  with  a  third  auto¬ 
correlation  f  ★f(x  -  w')  where  w'  is  within  the  support  of 
APji^(x).  Suppose  that  (O.T.)  .  0.  Then  from  Eq.  (34b)  it  is  seen 
that  the  support  of  APj|^(x)  consists  of  the  points  x^  -  Xj^  and  Xj  - 
Xp,  n  a  1 ,  2,  ...,  M.  We  first  take  the  case  where  w'  is  of  the  form 
Xn  -  X|^  and  treat  the  case  of  w'  »  x^.  -  x^  later.  Suppose  that  the 
specific  point  chosen  is  for  n  «  j'  ^  j,  k  (i.e.,  w'  *  Xj,  -  Xj^  ^  0 
and  w'  ^  w  a  Xj  -  Xj^).  Then  the  product  of  the  three  autocorre¬ 
lations  is 

APjkj-.k(’^)  *  [f^f(x  -  Xj  +  x^)]  [f^f(x  -  Xj,  +  x^)] 

'  [f^f(x  -  Xj.,  +  x,^)] 

■  [(?  *  (? 

to  ■ 

*  Z  -  'n  *  ’‘k<  *  Z  -  "J  * 

n^k.j  n^k.j 

.  -  V  V 

+  Z  Z  ’^n'  ■ ’^i'  " 

n-  m.^n'  "  m  n  j  x 

# 

(equation  continued) 


*  (Z'nVk*(’‘)  •  V'*  •  "j  *  "k)  *  -  ’'J'  * 


•  ’‘k)  *  fz  '^'’k'l’') 

\n,tk  / 


(37a) 


*  fz  *  ’‘k>  Yz '!)  fj'''  -  ’■o'  *  -s.)] 

\n^j  /  Wj*  /  (37b) 

That  is,  the  product  of  three  autocorrelations  has  the  same  support 
as  f(x  +  X|^),  as  was  shown  earlier  in  connection  with  Eq.  (30), 
since  B  is  just  the  support  of  the  product  of  three  such  autocorre¬ 
lation  functions.  Furthermore,  except  at  the  three  points  x  *  0, 

Xj  -  X|^,  and  x^,  -  Xj^  the  product  is  proportional  to  the  cube  of 
f(x  +  x,^). 

The  values  at  all  points  can  be  determined  as  follows:  First, 

0=  SfJ-f^f(O)  (38) 

n 

is  known,  so  that  factor  can  be  divided  out  from  the  last  three  terms 
of  Eq.  (37a).  Second,  let  the  coefficients  of  those  three  terms  in 
Eq.  (37a)  be  (with  J^f^  divided  out) 

A  > 

B  -  D"^ 

n 


^Pjkj-k(o)  -  i  ^j  ^j- 

(39a) 

^‘’jkj'k^^j  ■  ■  ^k  ^j  ^j* 

(39b) 

^^jkj'k^*j‘  *  *k^  ”  ^k  ^j 

(39c) 
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Solving,  we  get 


(40a) 

(40b) 

(40c) 


.  (ABC)’''’ 


(40d) 


The  remaining  values  of  for  m  ^  k,  j,  j'  can  then  be  computed 
by  dividing  Eq.  (37a)  by  fi,f,-fA,  and  then  taking  the  cube  root: 

K  J  J 


rwT" 


By  this  method  f(x)  is  reconstructed  exactly  to  within  a  translation, 
as  long  as  (O.T.)  «  0. 

In  performing  these  calculations,  had  we  chosen  a  second  trans¬ 
lation  of  the  form  (xj  -  Xj^,),  k'  ^  k,j  instead  of  (Xj,  -  X|^),  then 
the  result  would  have  been  similar,  except  a  translate  of  f(-x)  would 
have  been  reconstructed  instead  of  a  translate  of  f(x).  If  (O.T.)  ^ 
0,  that  is,  if  Condition  1  is  not  satisfied,  then  additional  terms 
appear  that  make  the  analysis  much  more  complicated  and  may  prevent 
the  reconstruction  of  f(x). 


Various  modifications  to  this  reconstruction  method  are  possible. 
For  example,  the  product  of  two  autocorrelation  products  AP..  (x) 

A 

APj,k(x)  is  proportional  to  f  (x  +  Xj^)  except  at  three  points. 

Another  example  is  to  define  the  autocorrelation  support  function  as 
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(41) 


M  _ 

A(x)  -  «(x)  +  t  I  «(x  -  x^  +  x^) 
n»l  m#n 

which  is  just  a  binary-valued  version  of  Eq.  (32b).  Then  the  product 
of  the  autocorrelation  function  with  two  properly  translated  auto¬ 
correlation  support  functions  is  proportional  to  a  translate  of  f(x), 
except  at  a  single  point  which  can  be  determined  by  a  few  extra  sim¬ 
ple  steps. 
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THE  OTHER  TERMS 


In  arriving  at  Eq.  (37),  it  was  assumed  that  the  other  terms 
(O.T.)  =  0,  or  equivalently  that  Condition  1  be  satisfied.  The  terms 
included  in  Eqs.  (34b)  and  (37)  are  those  that  necessarily  arise  by 
satisfying 


X  -  X  = 

m  n 


trivially,  for  example,  for  m  »  n,  m'  =  k,  and  n'  =  j.  The  other 
terms  are  those  that  satisfy  Eq.  (42)  by  chance,  that  is,  those  that 
arise  in  addition  to  those  that  (trivially)  arise  necessarily.  The 
trivial  solutions  are  the  ones  mentioned  in  connection  with  Condition 
1  in  Section  5.  These  other  terms  require  a  special  relationship 
between  the  points  in  S,  and  would  not  be  expected  to  occur  if  the 
points  in  S  are  randomly  distributed  in  some  region  of  E  . 


Figure  9  shows  in  two  dimensions  some  relationships  between 
points  in  S  that  would  cause  Eq.  (42)  to  be  satisfied  "by  chance", 
that  is  to  say,  in  a  nontrivial  way.  (S  may  contain  additional 
points  that  are  not  shown.)  For  example,  if  the  chords  between  three 
pairs  of  six  distinct  points  in  S  as  shown  in  Figure  9(a)  can  be 
translated  to  form  a  triangle  as  shown  in  Figure  9(b),  then  Eq.  (42) 
is  satisfied  nontrivially.  A  similar  result  holds  if  some  of  the 
endpoints  of  the  three  chords  are  the  same  points,  except  for  the 
trivial  case  of  three  points  already  in  the  form  of  a  triangle  de¬ 
fining  the  three  chords.  Note  that  when  m'  =  j  and  n'  =  k,  then  Eq. 

(42)  reduces  to  x„  -  x  *  2(x,.  -  x,  ).  That  is,  another  nontrivial 

case  is  the  existence  of  a  vector  separation  between  one  pair  of 
points  equaling  twice  the  vector  separation  between  another  pair  of 

points,  as  shown  in  Figure  9(c).  Also  note  that  when,  say,  m'  -  n', 

then  Eq.  (42)  reduces  to  x  -  x  =  x.  -  x,  .  That  is,  another 
nontrivial  case  is  the  existence  of  a  vector  separation  between  one 


pair  of  points  equaling  the  vector  separation  between  a  different 
pair  of  points,  as  shown  in  Figure  9((1).  This  last  case  is  also  a 
violation  of  Condition  3  [see  the  discussion  in  the  paragraph  fol¬ 
lowing  Eq.  (31)],  and  an  example  of  that  case  was  shown  in  Example  6 
depicted  in  Figure  8. 

As  mentioned  earlier,  if  the  points  x^^,  m  =  1,  2,  ...,  N,  are 
randomly  distributed,  then  it  would  be  unlikely  that  any  of  these 
special  relationships  exist,  and  so  one  would  expect  that  (O.T.) 
would  equal  zero  and  Condition  1  would  be  satisfied. 

These  results,  with  some  modifications,  can  also  be  extended  to 
the  case  of  an  object  having  support  on  a  number  of  disjoint  islands 
having  diameters  small  compared  with  their  separations  (as  opposed 
to  the  support  consisting  of  isolated  mathematical  points).  However, 
as  the  number  of  islands  increases  and  as  the  ratio  of  the  diameters 
of  the  islands  to  their  separations  increases,  .the  probability  of 
satisfying  a  condition  analogous  to  Condition  1  decreases. 
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8 

SUWARY 


We  have  described  a  number  of  new  results  relating  to  the  re¬ 
construction  of  the  support  of  an  object  function  from  the  support 
of  its  autocorrelation.  Locator  sets  that  contain  all  possible  solu¬ 
tions  were  described,  the  most  interesting  of  which  is  the  inter¬ 
section  of  the  autocorrelation  support  with  a  translate  of  itself. 

For  the  special  case  of  convex  sets  in  two  dimensions,  it  was  shown 
that  the  intersection  of  three  translates  of  the  autocorrelation  sup¬ 
port  is  a  solution.  These  solutions  can  usually  be  combined  to  form 
still  more  solutions.  Among  convex  sets  in  two  dimensions,  only 
parallelograms  have  unique  solutions.  For  the  case  of  objects  con¬ 
sisting  of  collections  of  distinct  points  it  was  shown  that  unless  a 
special  relationship  exists  between  the  points  in  the  object,  the 
intersection  of  three  translates  of  the  autocorrelation  support 
yields  the  solution,  and  it  is  unique.  If  instead  of  intersecting 
autocorrelation  supports  one  takes  the  product  of  translated  auto¬ 
correlation  functions,  then  for  the  same  objects  consisting  of  col¬ 
lections  of  points  one  can  easily  reconstruct  the  object  function 
itself. 

Some  of  the  results  on  objects  consisting  of  collections  of 
distinct  points  are  also  described  in  Reference  5.  These  results 
were  first  presented®  at  the  Annual  Meeting  of  the  Optical  Society 
of  America  in  Chicago,  October  1980. 
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APPENDIX  A 


If  the  support  of  f(x)  is  S,  then  the  support  of  f(y  +  x)  is 
S  -  y.  Then  the  interpretation  of  Eq.  (1)  as  the  weighted  sum  of 
f(y  +  x)  with  weights  f(y)  leads  intuitively  to  the  fact  that  A,  the 
support  of  f  tk-fCx),  is  given  by 

A  -  U  (S-y)-S-S. 
ycS 

A  rigorous  proof  for  compact  sets  S  is  as  follows. 

Define 

B(x,  r)  -  {y  e  E*'*  :  |x  -  y|  ^r), 
and 


^'^(x)  -  /  f(y)dy  . 

B(x,r) 

Then  X  e  S  if  and  only  if  f^(x)  >  0  for  all  r  >  0.  Also,  if 
|x  -  y|  <  r,  then  f2^(y)  ^  fp(x). 


Part  I;  S  -  S  C  A. 

Let  X,  yeS  and  let  r  >  0.  Then 


(f  ★f  )2r(^  -  y) 


/ 


f  ★  f(p)dp 


B(x-y,2r) 


J  J  f(w)f (w  +  p  +  X  -  y)dw  dp 

rN 


B(0,2r)  E 


f  f(w)  f2^(w  +  X  -  y)dw 


/  f(w  +  y)  f-  (w  +  x)dw 

>  /f(w  +  y)  f2^(w  +  x)dw 
B(0,r) 


>  J  f(w  +  y)dw  .  f^(x)  f^(y)  >  0, 

B(0,r) 

Therefore,  since  r  >  .0  was  arbitrary,  x  -  y  c  A,  and  so  S  -  S  C  a. 
Part  II:  A  C  S  -  S 

First,  note  that  since  S  is  compact,  S  -  S  is  a  closed  set.  Let 
X  c  A  and  let  r  >  0.  Then 

0  <  (f^f)^(x)  .  (f^f)^(-x) 

-  J  f*f(p)dp 
B(-x,r) 

~  f  f  ^(y  *  p  -  x)  dy  dp 

B(0,r) 

-  /  /  f(y  +  p)  f(y  +  X)  dy  dp 

B(0,r) 

-  /  f-(y)  fiy  *  X)  dy. 

eN 

Therefore,  there  exists  z  c  E***  such  that 

0  <  j  f^(y)  f(y  +  x)  dy  <  f2r(2)  /  ^y 

B(z,r)  B(z,r) 

-  f2r(2) 

Therefore,  f2r^^^  ^  ®  ^r^^  *  ^  snB(z,  2r)  ^  p  and 

snB(y,  r)  4  p,  where  y  -  z  +  x.  Choose  z'  t  sriB(z,  2r)  and  y'  e 

snB(y,  r).  Then  |y'  -  y|  <  r  and  (z*  -  z(  <  2r.  Then 

|(y'  -  z')  -  x|  -  |(y'  -  y)  -  (z'  -  z)| 

i  ly  -  yl  +  Iz'  -  zl  <  3r  . 

Since  r  ^0  was  arbitrary,  S  -  S  is  closed,  and  y'  -  z'  e  S  -  S,  it 

follows  that  x  c  S  -  S,  and  hence  A  £  S  -  S  and  hence  A  ■  S  -  S. 
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APPENDIX  B 

PROOF  THAT  A  -  B  -  B  FOR  CONVEX  SETS 

The  proof  that  A  »  B  -  B  will  be  dividei  into  two  parts.  First, 
it  will  be  shown  that  AC  6  -  B  and  then  it  will  be  shown  that 
A  3  B  -  B. 

Part  1;  A  £  B  -  B. 

It  can  be  shown  that,  because  A  has  non-null  interior,  the 
points,  0,  w.| ,  and  ^2  are  not  colinear.  Let 

B-j  «  c.hull  {0,  Wp  ^2)  . 

Since  B  is  the  intersection  of  three  convex  sets,  B  is  convex. 
Therefore,  B-j  £  B.  Let  A-j  •  B^  -  B^. 

Claim  1 ;  A^  C  a 

As  can  be  seen  from  Figure  Bl,  we  have 

Ai  ■  C.hull  {w^,  — *  ^2*  '*^1  “  '*^2*  '*^2  ~ 

Furthermore,  since  A  is  convex  and 

Wl*  “  '*2*  “  *^2'  '^1  ~  '^2’  '^2  “  '^1  '  ^  ’ 

it  follows  that  A^  £  A.  This  completes  the  proof  of  Claim  1. 

Now  let 

A2  -  A^  U  (B^  -  ^  +  W2  -  w^)  U  (B^  •*■«!-  W2) 

U(w^  +  W2  -  B^)  U(w^  -  W2  -  B^)  U  (W2  -  -  B^). 


See  Figure  B2. 


Claim  2;  A  C  A2* 

Let 

C  «  |w2  +  +  X2  (^2  -  Wi):  >  0  and  X2  >  0|. 

See  Figure  B3.  Note  that  C  is  an  open  set.  We  will  show  that  AOC 
»  d.  Suppose  not.  Then  let  p  c  A  H  C  and  let  D  -  c.hull  |p,  w-j, 

W2  -  See  Figure  B4.  Then  09  A  and  W2  c  int(O)  C  int(A). 

This  contradicts  the  assumption  that  W2  c  >(A).  Therefore,  AOC  •  #. 

Similar  arguments  apply  to  the  other  "notches"  in  A2.  There¬ 
fore,  A  9A2.  This  completes  the  proof  of  Claim  2. 

Summarizing,  we  have  B-j  9  B  and  A^9  A  9  A2. 

Let  p  c  A.  We  want  to  show  that  p  c  B  -  B.  We  have 

A  9  A2  -  [B^  u  (w^  '^2  ■  ^  -  '*1  -  “2^^ 

U [(B^  -  w^)  U (W2  -  w^  -  B^)]  U  [(w^  -  B^)  U  (B^  +  w^  -  W2)] 

U[(B^  -  W2)  U  (w^  -  '^2  ■  ^^'*2  -  ®i)  (®1  “2  ■  '^1^^* 

Case  1;  pc  B^  u  (w-j  +  W2  -  B^) 

We  have 

p  —  w-j  e  (B-j  —  w-j )  U  (W2  —  B'^ )  £  A'j  S  A. 

Therefore  p  e  w-j  +  A.  Also,  p  -  W2  c  (B^  -  W2)  U  (w^  -  B^)£  A^  £  A. 

Therefore  p  e  W2  +  A.  Since,  by  assumption,  p  c  A,  it  follows 

that  p  c  B.  We  also  have  0  c  B.  Therefore,  p-p-OcB-B. 

Case  2;  p  c  (-Bi)u  (B^  “  '*1  ~  ^*2^ 

We  have  -pc  B-jU  (w^  +  W2  -  B^).  Therefore,  by  Case  1,  -p  c 
B  -  B,  and  since  B  -  B  is  a  symmetric  set,  p  c  B  -  B. 


In  the  four  other  cases,  i.e.,  p  c(Bi  -  )  U  (w2  -  -  B-j), 

etc.,  we  find  that  p  e  B  -  B  by  similar  arguments.  This  completes 
the  proof  of  Part  1. 

Part  2:  A  2  B  -  8 

Let  s,  p  a  2»  3  lines  tangent  to  A  at  points  and 

Wi  -  ^2,  respectively.  See  Figure  B5.  Then,  since  A  is  symmetric, 
the  lines  -t2.  and  are  tangent  to  A  at  points  -w^,  -W2, 
and  W2  -  w-j.  respectively. 

It  follows  that  w^  -  a^  is  tangent  to  w-j  +  A  at  0.  Therefore, 
w-j  -a^  is  tangent  to  B  at  0.  See  Figure  B6.  Similarly,  W2  -a2  is 
tangent  to  W2  A  at  0.  Hence,  W2  -  is  tangent  to  B  at  0.  Also, 
^2  *  ^2  Tangent  to  W2  +  A  at  w^.  Therefore,  W2  +  ^2  i^  tangent 
to  B  at  w^ .  Finally,  w-j  -  a^  is  tangent  to  w-j  +  A  at  W2. 

Therefore,  w^  -  is  tangent  to  B  at  W2.  It  follows  that  B  lies 
between  lines  a^  and  w-j  -  a-j  and  between  a2  and  W2  -  ^2  and  between 

W2  +  ’^1  ~  ^3' 

Now  let  Cg  be  the  closed  positive  cone  spanned  by  w^  and  W2. 

That  is, 

Co  -{  wi  +  X2  W2:  >  0  and  X2  1  0}  . 

See  Figure  B7.  Let  C-j  be  the  closed  cone  spanned  by  -w^  and 
”2  ~  '*1  *  ^2  closed  positive  cone  spanned  by 

-W2  and  w^  -  W2.  Then 

e2  -  CgU  (-Cg)U  U(-Ci)  UC2  U(-C2). 

Next,  let 
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So-C„n,(B) 

S'!  •  (^1  C^j )  n  3(B) 

$2  -  (W2  *  Cg)  n  3(B) 

Then  3(B)  »  Sqij  y  $2*  See  Figure  B8. 

Now,  let  p^,  P2  c  B.  We  want  to  show  that  p^  -  P2  e  A. 

Case  1;  p^  -  P2  c  C2  . 

First,  if  p^  »  P2  then  p^  -  P2  ■  0  c  A. 

Now  assume  p^  ^  P2*  Let  i  be  the  (infinite)  line  passing  through 
p^  and  P2.  See  Figure  B9.  The  line  i  must  intersect  either  s^  or 

Subcase  1-a;  i  intersects  s^. 

See  Figure  B9.  Let  t'2  be  the  line  parallel  to  ^2  passing 
through  p^ .  Let  ^"2  be  the  line  parallel  to  *■2  and  passing  through 
P2- 

Now,  lines  1*2  ^”2  between  ^  and  W2  -  ^2*  Also,  0  is 

on  W2  -  ^2  *^2  ^2’  follows  that  and  i"2 

intersect  the  line  segment  [0,  W2].  Let  r^  be  the  point  at  which 
t'2  intersects  [0,  W2]  and  let  r2  be  the  point  at  which  ^"2  intersects 
[0,  W2].  Then,  since  0,  W2  e  B  and  B  is  convex,  r^  c  B  and  r2  c  B. 

Now  let  t'  be  the  line  parallel  to  t  and  passing  through  W2. 

Since  p^  -  P2  e  C2,  i  cannot  be  parallel  to  t2*  Therefore  i'  is  not 
parallel  to  t'2  or  t"2.  It  follows  that  i'  intersects  t'2  and  i"2* 

Let  be  the  point  of  intersection  of  i'  and  t'2  and  let  q2  be  the 
point  of  intersection  of  i'  and  i"2* 

R6 


Since  l  intersects  s^^,  it  follows  that  q-j  e  [p-j,  r^]  and  e 
[P2»  Since  ,  P2,  r^  r2  e  B  and  B  is  convex,  it  follows  that 

q-j  e  B  and  q2  e  6. 

Now,  since  p^  -  P2  e  C2,  we  must  have  q2  e  [q^,  W2].  That  is, 
there  exists  x,  0  <  x  <  1,  such  that  q2  »  x  q-j  +  (1  -  x)  W2  or 
q2  -  W2  -  x(q^  -  W2). 

Now  q^  e  B  c  W2  A.  Therefore,  q-j  -  W2  e  A.  Furthernx)re,  since 
0  <  1  -  X  <  1,  it  follows  that  (1  -  x)(qi  -  W2)  e  A. 

Finally,  we  have 

Pi  ■  P2  “  ‘’1  “  ‘’2 

-  (Pl  -  W2)  -  {q2  -  W2^ 

.  (1  -  x)(q^  -  W2)  e  A  . 

This  completes  the  proof  of  Subcase  1-a. 

Subcase  l~b;  i  intersects  s-j. 

See  Figure  BIO.  An  argument  similar  to  the  above  is  indicated 
in  Figure  BIO.  This  completes  the  proof  of  Case  1. 

Case  2:  p^  -  P2  e  -  C2  . 

We  have  P2  -  P]  «  ^2.  Therefore,  by  Case  1,  P2  -  p-j  e  A  and 
hence  p-j  -  P2  e  A. 

Similar  arguments  apply  when  p-j  -  P2  e  Cq,  -  C^,  C^  and  -C^. 

This  completes  Part  2  of  the  proof  that  A  ■  B  -  B  for  convex  sets. 
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APPENDIX  C 

PROOF  THAT  B  -  S  FOR  CERTAIN  POINT-LIKE  SETS 
In  this  appendix  we  take  the  approach  of  first  proving 
Theorem  1 . 

If  S  satisfies  Condition  2,  then  S  Is  equivalent  to  a  subset  of 
B. 

Then  we  prove 
Theorem  2. 

If  S  satisfies  Conditions  2  and  3,  then  S  Is  equivalent  to  B. 
Finally  we  prove 
Theorem  3. 

Condition  1  is  satisfied  If  and  only  If  both  Conditions  2  and  3 
are  satisfied. 

It  then  follows  from  Theorems  2  and  3  that  If  S  satisfies  Con¬ 
dition  1  then  S  Is  equivalent  to  B. 

Proof  of  Theorem  1 

Let  G  -  {O,  w^,  Wg}.  We  have  0,  w^  Wg  e  A.  Since  A  Is  sym¬ 
metric,  we  also  have  -w-j,  -W2  c  A.  Furthermore,  since  w^  e  w^  +  A, 

we  have  W2  -  w^  e  A  and  w^  -  W2  e  A.  Therefore, 

G  -  G  -{0,  w^,  -  w^,  W2,  -  W2,  w^  -  W2.  W2  -  w^icA 

Also,  G  has  three  members.  Therefore,  by  Condition  2,  there  exists 
V  e  E^  such  that  v  +  B  G  C  s  where  B  -  *  1.  It  follows  that  GC 
b(S  -  v).  Let  S^  -  b(S  -  v).  Then  GC  s^.  Also,  S  -  Sp  and  there¬ 
fore,  A  -  S^  -  S^.  We  want  to  show  that  S^C  B. 

Let  X  e  S^.  We  have  0  e  G  C  S^.  Therefore 

X  -  X  -  0  e  S^  -  Si  -  A.  (C-1) 
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Also,  w-j  e  G  CS-j.  Therefore,  x  -  w-]  c  $1  -  S-j  »  A,  and 

X  e  w-j  +  A.  {C-2) 


Similarly, 


X  e  W2  +  A. 


{C-3) 


From  (C-l),  (C-2),  and  {C-3),  it  follows  that  x  c  B.  Since  x  was  an 
arbitrary  element  of  S-j,  it  follows  that  S-^Q  B. 


Proof  of  Theorem  2 

Let  V,  0  and  S-j  be  defined  as  in  the  proof  of  Theorem  1.  Then, 

by  that  same  proof,  S-j  Q  B.  We  want  to  show  that  B  £  S-j.  Let  x 

e  B.  We  want  to  show  that  x  e  S^. 

First,  suppose  that  x  e  6.  Then,  since  6  £  S^  x  c  S^  and 
we  are  done.  Now  assume  x  G.  Let 

6^  *  |0,  X,  w.| ,  i  M  1 ,  2  . 

Since  x  c  B,  it  follows  that  x  e  A  and  -  x  c  A.  We  also  have  w^  c 
A  and  -w^  c  A.  Furthermore,  since  x  e  w^  +  A,  x  -  w^  c  A  and  w^  -  x 
e  A.  Therefore, 

-|0,  X,  -X,  w.,  -  w^,  X  -  w.,  w.  -  x{c  A. 

Also,  G^  has  three  members.  Therefore  by  Condition  2,  there  exists 
e  such  that  v.  +  0.  G^  £  S  where  0-  »  ±  1. 

Claim;  There  exists  j  e  {l,  2  [such  that  0j  «  0. 

Suppose  not.  Then  0.j  .  Pg  *  -0.  We  have 

|v,  v  +  0WpV  +  0W2(-v  +  0G£S 


and 

{v^,  v^  +  0^x,  v^.  +  0^.  w^  }  -  v^  +  0^  G^  £  S,  i  .  1,  2. 
Also,  since  0^  -  02  -  -0, 
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(v  +  B  W^)  -  V  -  -  (v^  +  W.),  i  «  1,  2. 

Now,  applying  Condition  3  with 

-  V-+  B  w.,  X2  -  V,  -  V.,  y^  »  V.  +  B^  w., 

we  conclude  that 

.  V  +  B  w^,  i  .  1,  2.  (C-4) 

Furthermore, 

-  (v^  +  B^x)  -  V2  -  (V2  +  B2X). 


It  therefore  follows  from  Condition  3  that 


(C-5) 


From  (C-4)  and  (C-5)  it  follows  that  w^  «  contradicts 
the  assumption  that  w^  ^  W2  and  proves  the  Claim  that  there  ex¬ 
ists  4  c  1 1,  2 1  such  that  Bj  =8. 

Now,  we  have 


(Vj  +  Bj  Wj.)  -  Vj  «  (v  +  B  Wj)  -  V. 

By  Condition  3,  v.  ■  v.  Therefore, 

V 

V  +  B  X  -  V j  +  Bj  X  e  Vj.  +  B j  Gj  S  S 


and 


X  c  b(S  -  v)  -  S^. 


Proof  of  Theorem  3 
Part  I:  1  -*  2. 

Let  G  have  three  elements,  0  c  G  S  A,  and  G  -  G  c  A.  We  want  to 
show  that  G  is  equivalent  to  a  subset  of  S-.  That  is  we  want  to  show 
that  there  exists  v  c  such  that  v  +  b  G  e  S  where  b  -  *  1. 

Let  G  -  jo,  Qp  92 1 .  Then  0  ^  g^  ^  92  ^  0  and 
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{ 0»  9if  "Q]*  92*  “92*  ~  ^2’  ^2  “  S]  }  “  ^  -  G  CA. 

Since  A  =■  S  -  S,  there  exists  x^,  X2,  y^,  y2.  z-j,  Z2,  e  S  such  that 
=  x^  -  x^,  92  -  9i  -  yi  -  y2»  ^'’‘1  -  92  =  -  Z2.  Then  x^  ^ 

and 

-  X2  +  -  y2  +  Zi  -  Z2  =  0.  (C-6) 

Therefore,  by  Condition  1, 

x^  =  y2  or  Z2,  and  x^  =  y^  or  z^. 

Claim  1;  Cannot  have  x^  =  y^  and  x^  *  y^ • 

If  this  were  the  case,  then  it  would  follow  from  (C-6)  that  z-j 

-  Z2  «  0.  But  z^  -  Z2  =  -  92  ^  0-  This  proves  Claim  1. 

Claim  2:  Cannot  have  x^  *  Z2  and  X2  =  . 

If  this  were  the  case,  then  it  would  follow  from  (C-6)  that  y^ 

-  y2  =  0.  But  y^  -  y2  *  92  -  9]  ^  proves  Claim  2. 

Therefore  either  x^  «  y2  and  X2  =  z^,  or  x^  »  Z2  and  X2  *  y^ . 

Case  1;  x^  -  y2  and  X2  *  z^ . 

We  have 

i'l  -  X2  -  -  X2  +  y^  y2  =  9]  92  -  9i  =  92 

and  x^  -  X2  *  .  Rewriting  these  equations,  we  have  Xg  +  92  =  y-; 

and  X2  +  g-]  »  x^ .  Therefore, 

X2  +  G  -  {X2.  X2  +  g^.  X2  +  92}  -  {X2.  xp  y^}  CS. 

Letting  v  «  X2  and  b  ■  1,  we  have  v  +  B  S. 
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Case  2;  x-j  =  Z2  and  X2  »  y-j . 

Vie  have 

Zi  -  X2  .  Xi  -  X2  +  2i  -  22  ■*  9i  -  32 

and  x^  -  X2  *  g-j .  This  yields  z-j  »  X2  +  -  g2  and  x^  »  X2 

gi .  Therefore, 

X2  +  g^  -  G  .{X2  +  9i.  X2.  X2  +  g^  -  g2}  -  {xp  X2.  z^fcs. 

Letting  v  «  X2  +  g-j  and  e  «  -1,  we  have  v  +  b  G  SS. 

Part  II:  1  3. 

Let  Up  U2.  Vp  V2  e  S,  with  u-j  ^  U2  and  u^  -  U2  =  v^  - 

want  to  show  that  »  Vp  .  Let 

X]  -  V2,  X2  -  vp  y^  -  Vp  y2  *  1^2’  *  “r  ^2  “ ''r 

Then  x^  i  ^2  and 

X]  -  X2  +  y^  -  y2  +  2]  -  Z2  *  0  • 

Therefore,  by  Condition  1,  x-j  «  y2  or  Z2,  and  X2  «  y-j  or  Zp  Now 

Z2  -  X]  -  vi  -  V2  .  -  U2  ,1  0. 

Therefore,  x^  ^  Z2  and,  hence,  x^  «  y2,  or  V2  »  U2.  Since,  by 

assumption,  u-j  -  U2  »  v-j  -  it  follows  that  u-j  »  Vp  This 
completes  Part  II  of  the  proof. 

Part  III;  2  and  3  1 

Let  Xp  X2,  yp  y2»  Zp  Z2  e  S  with  x-j  ^  X2  and 

Xi  -  X2  +  y^  -  y2  Z]  -  ^2  * 

We  want  to  show  that  x-j  «  y2  or  Z2,  and  X2  -  y-]  or  Zp 
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Case  1:  »  y2* 

In  this  case,  it  follows  from  {C-8)  that 

Xi  -  X2  =  Z2  -  Zi  .  (C-9) 

Therefore,  by  Condition  3,  x.|  =  Z2  and  it  follows  from  (C-9)  that 

X2  =  z^. 

Case  2:  =  Z2. 

Here,  it  follows  from  (C-8)  that 

Xi  -  X2  =  y2  -  y^.  (C-10) 

Then  by  Condition  3,  x^  =  and  it  follows  from  (C-10)  that  X2  « 

Case  3:  y^  ^  y2  and  z^  ^  Z2. 

Let  .  x^  -  X2  and  92  .  y2  -  y^  and  let  G  «  |0,  ,  92}. 

Now,  gp  92  c  S  -  S  «  A  and  hence,  also  -  g^  -92  e  A.  Furthermore, 

91  -  92  *  x^  -  X2  +  y^  -  y2  =  Z2  -  z^  e  S  -  S  =  A 

and  hence,  also  92  -  g-j  e  A.  Thus  we  have  0  e  GC  a  and  G  -  G  ^  A. 

We  also  have  *  x^  -  X2  ^  0,  92  “  ^2  ”  ■^l  ^  9^  -  92  = 

Z2  -  z^  ^  0.  Therefore,  G  has  three  members.  Hence  by  Condition  2, 
there  exists  v  e  such  that  v  +  b  G  £  S,  where  b  »  *  1. 

Subcase  3-a:  b  »  1 . 

We  have 

{v,  V  +  gp  V  +  92!  -  V  +  B  G  C  S.  (C-11) 
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Now,  (v  +  Qi )  -  V  »  Qi  »  Xi  -  X2-  Therefore,  by  Condition  3,  v  « 
x^,  and  hence,  by  (C-ll),  *2  e  S  and  X2  +  92  c  S. 

Now  (X2  +  92)  -  X2  «  92  *  y2  -  •  Therefore,  by  Condition  3,  X2  +  92 

a  y2  and  X2  *  y^*  Also 

(X2  +  9^)  -  (X2  +  92)  -  9i  -  92  =  ^2  -  2^  . 

Therefore,  by  Condition  3,  X2  +  g^  *  22*  But  X2  +  g^  *  x-j . 

Therefore  Xi  =22* 


Subcase  3-b;  6  *  -1 

We  have 

{v,  V  -  gp  V  -  92}  »  V  +  e  G  S  S  .  (C-12) 

Now,  V  -  (v  -  9^ )  *  g-j  »  -  X2.  Therefore,  by  Condition  3,  v  * 

x^ ,  and  hence,  by  (C-12),  x^  -  g^  e  S  and  x^  -  92  c  S.  Now,  x^  - 
(x^  -  92)  a  92  -  y2  -  y^  Therefore,  by  Condition  3,  x^  -  y2.  Also, 

(x-j  -  91 )  ~  (xi  -  92)  *  92  “  9]  *  ^1  ”  ^2’ 

Therefore,  by  Condition  3,  x^  -  ’^l  "  ^1  *  ’^2* 

Therefore,  X2  a  z-^. 
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FIGURE  CAPTIONS 

1.  (a)  Set  S;  (b)  three  of  the  translates  of  S  that  make  up  A; 

(c)  autocorrelation  support  A  -  S  -  S. 

2.  A  symmetric  set  that  is  not  an  autocorrelation  support. 

3.  Locator  Sets,  (a)  Set  S;  (b)  A  «  S  -  S;  (c)  locator  set  L  = 

1/2  P;  (d)  formation  of  L  »  A  O  (w  +  A);  (e)  and  (f)  two  other 
members  of  the  family  of  locator  sets. 

4.  Locator  sets  (shaded  areas)  for  the  circle,  (a)  AHH;  (b)  the 
square  1/2  P;  (c)  A  H  (w  +  A),  (d)  circle  of  radius  1/vT. 

5.  Autocorrelation  tri-intersection  solution  for  convex  sets. 

(a)  Set  S;  (b)  A  *  S  -  S;  (c)  formation  of  locator  set  L  - 
A  n(w  +  A);  (d)  formation  of  solution  B  =  A  n(wi  +  A)  n 
(w2  A);  (e)  and  (f)  two  other  solutions  of  the  form  B. 

6.  Sphere/circle  example. (a)  Set  S;  (b)  A  *  S  -  S;  (c)  B  * 

A  n (wi  +  A)  n  (w2  +  A);  (d)  another  solution  for  the  circle 
combining  the  solutions  (a)  and  (c). 

7.  Autocorrelation  tri-intersection  for  sets  consisting  of  a  col¬ 
lection  of  distinct  points,  (a)  Set  S;  (b)  A  «  S  -  S;  (c)  and 

(d)  locators  of  the  form  L  «  A  O  (w  +  A).  Intersecting  (c)  or 
(d)  with  (b)  yields  the  unique  solution  (a). 

8.  Autocorrelation  intersection  for  redundant  case,  (a)  Set  S; 

(b)  A  »  S  -  S;  (c),  (d),  and  (e)  locators  of  the  form  L  = 

A  n  (w  +  A);  (f)  intersection  of  (c)  with  (d);  (g)  another 
intersection  of  three  translates  of  A. 

9.  Redundancy-type  relationships  within  S  that  would  violate  Con¬ 
dition  1.  (a)  and  (b)  Three  vector  separations  add  to  zero; 

(c)  one  vector  separation  is  twice  another;  (d)  two  vector  sepa¬ 
rations  are  equal. 

Bl-BlO  (see  Text) 
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Figure  5.  Autocorrelation  tri-intersection  solution  for  convex  sets. 

(a)  Set  S;  (b)  A  -  S  -  S;  (c)  fonnation  of  locator  set  L  ■ 
A  n(w  +  A);  (d)  formation  of  solution  B  -  A  n(wi  +  A)  fi 
(w2  A);  (e)  and  (f)  two  other  solutions  of  the  form  B. 


V  w 


A  J*. 


Figure  7.  Autocorrelation  tri-intersection  for  sets  consisting  of  a  col¬ 
lection  of  distinct  points,  (a)  Set  S;  (b)  A  ■  S  -  S;  (c)  and 
(d)  locators  of  the  form  L  -  A  O  (w  +  A).  Intersecting  (c)  or 
(d)  with  (b)  yields  the  unique  solution  (a). 


(b) 


(c) 


s 


(d) 


(e) 


(f) 


(9) 


Figure  8.  Autocorrelation  intersection  for  redundant  case,  (a)  Set  S; 
(b)  A  ■  S  -  S;  (c),  (d),  and  (e)  locators  of  the  form  L  ■ 

A  0  (w  +  A);  (f)  intersection  of  (c)  with  (d);  (g)  another 
intersection  of  three  translates  of  A. 
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(c)  (d) 


Figure  9.  Redundancy- type  relationships  within  S  that  would  violate  Con¬ 
dition  1.  (a)  and  (b)  Three  vector  separations  add  to  zero; 

(c)  one  vector  separation  is  twice  another;  (d)  two  vector  sepa¬ 
rations  are  equal. 
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F03.  Comparison  of  Phase  Retrievial  Algorithms.*  J  R. 
KIENUP,  Radar  and  Optics  Dmsion,  Environmental  Research  In¬ 
stitute  of  Michigan,  P.O.  Rox  8618,  Ann  Arbor,  Mich  48107  — Dif¬ 
ferent  versions  of  the  iterative  Fourier  transform  method'  oi  recon¬ 
structing  an  object  from  the  modulus  of  its  Fourier  transform  (solving 
the  phase  retrieval  problem)  are  compared.  A  proof  of  convergence 
of  the  error  reduction  algorithm  is  given.  The  input-output  algo¬ 
rithms,  although  not  guaranteed  to  converge,  in  practice  converge 
faster  than  the  error  reduction  algorithm.  Other  algorithms  having 
accelerated  convergence  are  also  discussed.  The  steepest  descent 
method  is  shown  to  be  closely  related  to  the  iterative  Fourier  trans¬ 
form  method.  Application  of  these  methods  to  complicated  two- 
dimensional  objects  has  in  practice  usually  resulted  in  unique  re¬ 
constructions.  (13  min.) 

•  Research  supported  by  the  t'.-S.  Air  Force  Office  of  Scientific  Research. 

'  I.  R.  Fienop.  Opt.  Lett.  3, 27  (1978t. 


Reprinted  from:  J.  Opt.  Soc.  Am.  71,  1641  (1981).  Presented  at  the 
1981  Annual  Meeting  of  the  OpticaT”Society  of  America,  Kissimmee,  FL, 
October  1981. 
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COMPARISON  OF  PHASE  RETRIEVAL  ALGORITHMS 


by 

J.R.  Fienup 

Radar  and  Optics  Division 
Environmental  Research  Institute  of  Michigan 
P.O.  Box  8618,  Ann  Arbor,  MI  48107 


Abstract 


Iterative  algorithms  for  phase  retrieval  from  intensity  data  are 
compared  with  gradient  search  methods.  Both  the  problem  of  phase 
retrieval  from  two  intensity  measurements  (in  electron  microscopy  or 
wavefront  sensing)  and  the  problem  of  phase  retrieval  from  a  single 
intensity  measurement  plus  a  nonnegativity  constraint  (in  astronomy) 
are  considered,  with  emphasis  on  the  latter.  It  is  shown  that  both 
the  error-reduction  algorithm  for  the  problem  of  a  single  intensity 
measurement  and  the  Gerchberg-Saxton  algorithm  for  the  problem  of 
two  intensity  measurements  "converge.’’  The  error-reduction  algorithm 
is  also  shown  to  be  closely  related  to  the  steepest  descent  method. 
Other  algorithms,  including  the  input-output  algorithm  and  the  con¬ 
jugate  gradient  method  are  shown  to  converge  in  practice  much  faster 
than  the  error-reduction  algorithm.  Examples  are  shown. 
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1.  INTRODUCTION 


In  electron  microscopy,  wavefront  sensing,  astronomy,  crystallog¬ 
raphy,  and  other  fields,  one  often  wishes  to  recover  phase  although 
only  intensity  measurements  are  made.  One  is  usually  interested  in 
determining  an  object,  f(x),  which  is  related  to  its  Fourier  trans¬ 
form,  F(u),  by 

F(u)  -  lF(u)l  .  9rCf(x)3  ,  f  f(x)  dx  (1) 

mOO 

where  x  is  an  M-dimensional  spatial  coordinate  and  u  is  an 
M-dimensional  spatial  frequency  coordinate.  For  the  majority  of 
interesting  problems,  M  «  2.  In  practice,  one  deals  with  sampled 
data  in  the  computer  where,  for  the  t.<o-dimensional  case,  assuming 
square  arrays,  u  »  (u-j,  U2)  and  x  «  (Xp  X2),  where  Up 
U2,  Xp  and  X2  *  0,  1,2,  .  .  .,  N  -  1.  Then  one  uses  the 
discrete  Fourier  transform  (OFT) 

N-1 

F(u)  »  ^  f(x)  exp  (-i2itu»x/N)  (2) 

x«0 

and  its  inverse 

N-1 

f(x)  »  N“^  ^  F(u)  exp  (i2itu»x/N)  (3) 

u«0 

which  are,  of  course,  computed  using  the  fast  Fourier  transform  (FFT) 
method. 

For  the  problem  of  recovering  phase  from  two  intensity  measure¬ 
ments,  as  in  electron  microscopy  ano  in  wavefront  sensing, 

f(x)  »  lf(x)l  (4) 
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is  complex  valued  and  one  wishes  to  recover  'A(u),  or  equivalently 
recover  n(x)  from  measurements  of  both  lF(u)l  and  |f(x)|.  For  the 
problem  of  recovering  phase  from  a  single  intensity  measurement,  as 
in  image  recovery  from  speckle  interferometry  data  in  astronomy  and 
from  structure  factors  in  crystallography,  one  wishes  to  recover 
^(u),  or  equivalently  recover  f(x),  given  a  measurement  of  1f(u)1 
and  the  constraint  that  f(x)  be  real  and  nonnegative, 

f(x)  >  0  .  (5) 

A  particularly  successful  approach  to  solving  these  problems  is 

1  2 

the  use  of  the  Gerchberg-Saxton  algorithm  ’  and  related  algo- 
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rithms  .  Reference  7  lists  a  large  number  of  different  problems 
which  have  been  solved  by  these  algorithms.  These  algorithms  involve 
iterative  Fourier  transformation  back  and  forth  between  the  object 
domain  and  the  Fourier  domain,  and  application  of  the  measured  data 
or  known  constraints  in  each  domain.  ‘ 

In  what  follows,  a  generalized  Gerchberg-Saxton  algorithm,  re¬ 
ferred  to  as  the  error-reduction  algorithm,  and  its  convergence 
properties  are  reviewed  (Section  2),  and  it  is  shown  to  be  equivalent 
to  the  steepest-descent  gradient  search  method  (Section  3).  Alterna¬ 
tive  gradient  search  methods  (Section  4)  and  iterative  Fourier  trans¬ 
form  algorithms  (Section  5)  are  described,  and  are  shown  to  converge 
much  faster  than  the  error-reduction  algorithm  for  the  problem  of  a 
single  intensity  measurement  (Section  6).  Some  practical  considera¬ 
tions  are  discussed  in  Section  7.  A  typical  reconstruction  experi¬ 
ment  is  shown  in  Section  8,  and  the  major  conclusions  are  summarized 
in  Section  9. 


2.  THE  ERROR-REDUCTION  ALGORITHM 

The  Gerchberg-Saxton  algorithm  was  originally  invented  in  connec¬ 
tion  with  the  problem  of  reconstructing  phase  from  two  intensity 
1  2 

measurements  ’  (and  for  synthesizing  phase  codes  given  intensity 
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constraints  in  each  of  two  domains  ’).  The  algorithm  consists  of 
the  following  four  simple  steps:  (1)  Fourier  transform  an  estimate 
of  the  object,  (2)  replace  the  modulus  of  the  resulting  computed 
Fourier  transform  with  the  measured  Fourier  modulus  to  form  an  esti¬ 
mate  of  the  Fourier  transform,  (3)  inverse  Fourier  transform  the 
estimate  of  the  Fourier  transform  and  (4)  replace  the  modulus  of  the 
resulting  computed  image  with  the  measured  object  modulus  to  form  a 
new  estimate  of  the  object.  In  equations,  this  is,  for  the 
iteration. 


G^(u)  . 

|G,^(u)|  exp  Ci^,^(u)]  *9'^Cg|^(x)] 

(6) 

G,;(u) . 

|F(u)|  exp  Ci0,^(u)] 

(7) 

gi;(x)  . 

|g,^(x)|  exp  Ci®,^(x)]  »  9r“^[G,^(u)] 

(8) 

■ 

(f(x)|  exp  Ci©,^+^(x)]  =.  jf(x)|  exp  Ci&|^(x)] 

(9) 

where  gj^,  9^,  Gj^  and  are  estimates  of  f,  n,  F  and  respectively. 
Here  and  throughout  this  paper,  functions  represented  by  upper-case 
letters  are  the  Fourier  transforms  of  the  functions  represented  by 
the  corresponding  lower-case  letters. 

The  Gerchberg-Saxton  algorithm,  as  depicted  in  Figure  1,  is 
easily  generalized  to  a  large  class  of  problems®’^.  The  general¬ 
ized  Gerchberg-Saxton  algorithm  can  be  used  for  any  problem  in  which 
partial  constraints  (in  the  form  of  measured  data  or  information 
known  a  priori)  are  known  in  each  of  two  domains,  usually  the  object 
(or  image)  domain  and  the  Fourier  domain.  One  simply  transforms  back 
and  forth  between  the  two  domains,  satisfying  the  constraints  in  each 
domain  before  returning  to  the  other  domain.  This  generalization  of 
the  Gerchberg-Saxton  algorithm  will  be  referred  to  as  the  error- 
reduction  algorithm,  since,  as  will  be  shown  below,  the  error  de¬ 
creases  at  each  iteration. 


For  the  most  general  problem,  the  error-reduction  algorithm  con¬ 
sists  of  the  following  four  steps:  (1)  Fourier  transform  g|^(x), 
an  estimate  of  f(x);  (2)  make  the  minimum  changes  in  6|^(u),  the 
resulting  computed  Fourier  transform,  which  allow  it  to  satisfy  the 
Fourier-domain  constraints,  to  form  G|^(u),  an  estimate  of  F(u); 

(3)  inverse  Fourier  transform  6||(u);  and  (4)  make  the  minimum 
changes  in  g|^(x),  the  resulting  computed  image,  which  allow  it  to 
satisfy  the  object-domain  constraints,  to  form  g|^^.^(K),  a  new  esti¬ 
mate  of  the  object.  In  particular,  for  the  problem  of  a  single  in¬ 
tensity  measurement  (as  in  astronomy),  the  first  three  steps  are 
identical  to  the  first  three  steps  of  the  Gerchberg-Saxton  algorithm, 
Eqs.  (6),  (7)  and  (8);  and  the  fourth  step  is  given  by 

fg^Cx),  X  ^  Y 

(10) 

0,  X  c  Y 

where  y  is  the  set  of  points  at  which  g||(x)  violates  the  object- 
domain  constraints,  i.e,,  wherever  g|^{x)  is  negative  or  (optionally) 
where  it  exceeds  the  known  diameter  of  the  object.  The  diameter  of 
the  object  can  be  computed,  since  it  is  just  half  the  diameter  of 
the  autocorrelation  function,  which  is  the  inverse  Fourier  transform 
of  the  squared  Fourier  modulus.  (However,  in  two  dimensions,  the 
exact  support  of  the  object  cannot  in  general  be  determined  uniquely 
from  the  support  of  its  autocorrelation^®,  and  so  the  diameter 
constraint  cannot  be  applied  very  tightly.) 

The  iterations  continue  until  the  computed  Fourier  transform 
satisfies  the  Fourier-domain  constraints  or  the  computed  image  satis¬ 
fies  the  object-domain  constraints;  then  one  has  found  a  solution,  a 
Fourier  transform  pair  that  satisfies  all  the  constraints  in  both 
domains.  The  convergence  of  the  algorithm  can  be  monitored  by  com¬ 
puting  the  squared  error.  In  the  Fourier  domain,  the  squared  error 
is  the  sum  of  the  squares  of  the  amounts  by  which  G|^(u),  the  com¬ 
puted  Fourier  transform,  violates  the  Fourier-domain  constraints. 
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Since  G|^(u)  was  formed  from  G|^(u)  by  making  the  minimum  changes 
in  order  to  satisfy  the  Fourier-domain  constraints,  the  squared  error 
can  be  expressed  as 


iG^(u)  -  G'(u)|2  (11) 

u 

which,  for  both  problems  being  considered,  can  be  expressed  as 


[lS^(u)l  -  1f(u)|]2 


u 


(12) 


In  this  section,  the  symbol  Epj^  is  used  to  distinguish  it  from 

the  object-domain  error,  Eqj^,  described  below.  For  economy  of 

notation  in  the  section  of  this  paper  dealing  with  the  gradient 

2 

methods,  the  symbol  Bj^  is  used  instead  of  ^Fk*  The  symbol  B 
(with  the  subscript  k  deleted)  is  given  by  Eq.  (11)  with  G  and  G' 
replacing  Gj^  and  Gj^,  respectively. 

Similarly,  for  the  error-reduction  algorithm,  the  squared  error 
in  the  object  domain  can  be  expressed  as 


X 

which,  for  the  problem  of  two  intensity  measurements  can  be  expressed 
as 

4-ZcifU)|- |9i;(x)|]2  (14) 

X 


and  for  the  problem  of  a  single  intensity  measurement  can  be  ex¬ 
pressed  as 


where  y  is  defined  as  in  Eq.  (10).  The  asymmetry  in  the  use  of  the 
_2 

N  factor  above  was  chosen  because  of  the  similar  asymmetry  in 
the  definition  of  the  discrete  Fourier  transform  in  Eqs.  (2)  and  (3). 
When  the  squared  error  is  zero,  then  a  solution  has  been  found. 

In  the  following,  the  error-reduction  algorithm  is  shown  to  con¬ 
verge,  and  this  convergence  property  holds  for  all  the  applications 
of  the  error-reduction  algorithm  (not  just  the  problems  of  recovering 
phase  from  a  single  or  from  two  intensity  measurements). 

For  the  general  problem,  at  the  iteration,  the  squared 
error  in  the  Fourier  domain  is  given  by  Eq.  (11).  By  Parseval's 
theorem^  ^ , 

u 

X 

Now  compare  this  with  Eq.  (13),  the  error  in  the  object  domain.  Both 
g|^(x)  and  g|^+i(x)  by  definition  satisfy  the  object-domain  con¬ 
straints.  Also,  at  any  point  x,  by  definition  g|^^.i(x)  is  the  near¬ 
est  value  to  gj^(x)  that  satisfies  the  object-domain  constraints. 
Therefore,  at  all  points  x, 

|g,^+l(x)  -  g,[(x)|  <  |g,^(x)  -  g|^(x)|  (17) 

and  therefore,  from  Eqs.  (13)  and  (16), 

^Ok  -  ^Fk  ' 

Similarly,  by  Parseval's  theorem. 
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^Ok  •  S 

X 


Since  both  G|^(u)  and  Si;+l(u)  satisfy  the  Fourier  domain  constraints, 
and  at  any  point  u,  by  definition  *’k+l^  u)  is  the  nearest  value  to 
G|j+^(li)  that  satisfies  the  Fourier -domain  constraints,  then 

|S„+l(u)  -G;;+^(u)l  <  |G^+i(u)  -G^(u)l  .  (20) 


Therefore,  from  Eqs.  (11)  and  (19), 

^F,k+1  - 

and  combining  this  with  Eq..  (18)  gives  the  desired  result 

i  4  i  4  •  <22) 

That  is,  the  error  can  only  decrease  (or  stay  the  same)  at  each 
iteration. 


In  practice,  the  error -reduction  algorithm  usually  decreases  the 

error  rapidly  for  the  first  few  iterations,  but  then  much  more  slowly 

1  2  9  12 

for  later  iterations  The  speed  of  convergence  also  de¬ 

pends  on  the  type  of  constraints  imposed.  Convergence  seems  to  be 
reasonably  fast  for  the  problem  of  two  intensity  measurements,  but 
painfully  slow  for  the  problem  of  a  single  intensity  measurement. 
Figure  2  shows  an  example  of  the  error  as  a  function  of  the  number 
of  interations  of  the  error-reduction  algorithm  for  the  problem  of  a 
single  intensity  measurement.  Shown  is  the  normalized  rms  error, 
i.e.,  the  square  root  of  Ep|^  divided  by^J  F(u)|  Both 
the  error  and  the  iteration  number  are  shown  on  logarithmic  scales. 
The  error  decreases  rapidly  during  the  first  30  iterations,  but  then 
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reaches  a  plateau  at  a  level  of  0.16,  decreasing  very  slowly.  After 
70  iterations,  the  error  again  starts  to  decrease  until  it  reaches  a 
second  plateau  at  the  level  of  0.02,  at  which  point  the  error  de¬ 
creases  extremely  slowly.  After  2,000  iterations,  the  error  once 
again  starts  to  decrease  until  it  reaches  a  third  plateau  at  the 
level  of  0.003,  at  which  point  the  decrease  in  the  error  is 
negligible. 

The  occurrences  of  plateaus  during  which  convergence  is  extremely 

1  9  12 

slow  seems  to  occur  more  often  than  not  for  both  applications  ’  ’  , 

and  it  has  in  the  past  been  observed  that  with  persistence,  one  can 

go  beyond  the  plateau  region  and  again  make  rapid  progress  toward  a 
1  12 

solution  ’  .  However,  as  shown  in  the  example  of  Figure  2  for 
the  problem  of  a  single  intensity  measurement,  the  number  of  itera¬ 
tions  required  for  convergence  of  the  error-reduction  algorithm  can 
be  extremely  large,  making  that  algorithm  unsatisfactory  for  that 
application.  Fortunately,  as  will  be  shown  in  later  sections,  other 
related  algorithms  converge  much  faster,  reconstructing  a  recogniza¬ 
ble  image  in  20  or  30  iterations,  and  completing  the  reconstruction 
in  under  one  hundred  iterations,  which  takes  less  than  two  minutes 
on  a  Floating  Point  System  AP-120B  array  processor  for  array  sizes 
of  128  X  128. 

3.  THE  STEEPEST  DESCENT  METHOD 

An  alternative  approach  to  solving  the  phase  retrieval  problems 
is  to  employ  one  of  the  gradient  search  methods.  In  this  section, 
it  is  shown  that  one  such  method,  the  steepest  descent  method,  is 
closely  related  to  the  error  reduction  algorithm  for  the  problem  of 
reconstructing  phase  from  a  single  intensity  measurement.  The  rela¬ 
tionship  between  the  steepest  descent  method  and  the  Gerchberg-Saxton 
algorithm  is  also  discussed. 
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An  example  of  how  a  gradient  search  method  would  be  used  for  this 

2 

problem  is  the  following.  One  can  define  B  *  Ep,  the  squared 
error  in  the  Fourier  domain  given  by  Eq.  (12),  as  the  error  metric 
which  one  seeks  to  minimize  by  varying  a  set  of  parameters.  Here, 
the  N  values  of  g(x),  the  estimate  of  f(x),  are  treated  as  N 
independent  parameters.  Starting  at  a  given  point,  g|^(x),  in  the 
N^-dimensional  parameter  space,  one  would  reduce  the  error  by  com¬ 
puting  the  partial  derivatives  of  B  with  respect  to  each  of  the 
points  g(x)  (N  partial  derivatives),  forming  the  gradient,  and 
then  move  from  g|^(x)  in  a  direction  opposite  that  of  the  gradient 
to  a  new  point  g|||(x).  One  would  then  form  a  new  estimate,  g|j+i(x), 
from  g|^'(x)  by  forcing  the  object-domain  constraints  to  be  satisfied. 
This  would  be  done  iteratively  until  a  minimum  (hopefully  a  global 

minimum)  is  found.  That  is,  one  minimizes  the  error,  B,  as  a  func- 
2 

tion  of  the  N  parameters,  g(x),  subject  to  the  object-domain 
constraints. 

2 

Ordinarily,  the  computation  of  the  N  partial  derivatives  would 
be  a  very  lengthy  task  since  each  evaluation  of  B  involves  an  N  x  N 
discrete  Fourier  transform.  However,  for  the  problems  considered 
here,  the  computation  can  be  greatly  reduced,  as  described  below. 

First,  consider  the  problem  of  a  single  intensity  measurement. 

The  partial  derivative  of  B  with  respect  to  a  value  at  a  given  point, 
g(x),  is  [g(x)  is  assumed  to  be  real  since  f(x)  is  real],  using 

Eq.  (12), 

’5®  "  IgW  ■  ns(u)|  -  |FW)|]  •  (“) 

u 

Later,  the  notation  a  8,,  will  be  used  to  denote  a„B  evaluated 

g  k  g 

at  g(x)  »  g|^(x).  Since 
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(24) 


C-i2iTU.y/N] 

y 


then 


«  exp  [-i2iru.x/N]  , 


al  G(u)| 
3g(x) 


3g(x)  2  jG{u)l 


6(u)  exp  [i2iru*x/N]  +  6 
2|G(u 


alGfu)!^ 

3g(x) 

(u)  exp  [-i2Tru»x/N] 


Therefore,  Eq.  (23)  becomes 


(25) 


3gB  -  N"^^CG(u)  -  lF(u)l6(u)/lG(u)l]  exp  [i2iTU.x/N] 
u 

+  -  |P(u)Ig*(u)/1g(u)1]  exp  [-i2iTU.x/N]  .(26) 

u 

Using  Eqs.  (6)  and  (7)  to  define  6'(u)  as 

G'(u)  .  1f(u)|G(u)/|G(u)1  ,  (27) 

and  noting  that  Eq.  (26)  is  in  the  form  of  a  discrete  Fourier  trans¬ 
form,  it  can  be  reduced  to 

3g8  -  2[g(x)  -  g'(x)]  (28) 

where  g'(x)  is  defined  by  Eq.  (8),  and  the  fact  that  g(x)  and  g'(x) 
are  real  valued  has  been  used.  [Note  that  since  3g'(y)/3g(x)  ^  0, 
it  is  not  true  that  Eq.  (28)  follows  immediately  from  Eq.  (16).] 

From  Eq.  (28),  it  is  evident  that  the  entire  gradient,  consisting  of 
the  N  partial  derivatives,  can  be  computed  very  simply  by  Fourier 
transforming  g(x),  applying  Eq.  (27),  inverse  Fourier  transforming 
to  arrive  at  g'(x),  subtracting  g'(x)  from  g(x)  and  multiplying  by  a 
constant.  In  fact,  the  computation  of  g'(x)  is  identical  to  the 
first  three  steps  of  the  error-reduction  algorithm. 
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The  "optimum"  step  size  to  take  in  the  direction  of  the  gradient 
can  be  determined  by  forming  a  first-order  Taylor  series  expansion 
of  B  as  a  function  of  g(x),  about  the  point  g|^(x), 

B  “  ^  agBj^[g(x)  -  g,^(x)]  .  (29) 

X 

This  first-order  expansion  of  B  is  equal  to  zero  at  g(x)  « 
given  by 

B  d  B 

g;(x)  -  9k('<)  -  -  .  (30) 

^(»g8k> 

Which  can  be  easily  verified  by  inserting  Eq.  (30)  into  Eq.  (29). 
Since  by  Eqs.  (28)  and  (16). 

y  y 

Eq.  (30)  becomes 

gi;!(x)  -  g,^(x)  >  -(l/^)3gB,^ 

»  (l/2)Cg,;(x)  -  g^(x)]  .  (32) 

However,  since  B  is  quadratic  in  g(x),  the  linear  approximation  above 

can  be  expected  to  predict  a  step  size  half  as  large  as  the  opti- 

2 

mum  .  Therefore,  one  should  use  the  double-length  step, 

g||.'(x)  -  g,^(x)  «  [g,^(x)  -  g,j(x)] 


or 

g{!(x)  »  g,J(x)  .  (33) 

In  fact,  since  iGj^(u)  j  »  lF(u)l,  moving  to  g.^(x)  reduces  the  error, 
Eq.  (12),  to  zero  exactly.  As  a  final  step  in  one  iteration,  the 
new  estimate  should  be  made  to  satisfy  the  object-domain  constraints, 
which  is  accomplished  by  using  Eq.  (10). 
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Comparing  this  new  estimate  to  that  of  the  error-reduction  algo¬ 
rithm  described  in  Section  2,  it  is  seen  that  they  are  identical. 

That  is,  the  error-reduction  iterative  Fourier  transform  algorithm 
can  be  looked  on  as  a  rapid  method  of  implementing  a  double-length 
step  steepest  descent  method. 

Although  the  steepest  descent  method  is  identical  to  the  error- 
reduction  algorithm  for  the  problem  of  a  single  intensity  measure¬ 
ment,  the  connection  is  not  so  close  for  the  problem  of  two  intensity 
measurements,  as  explored  in  Appendix  A.  In  that  case,  the  error  is 
minimized  with  respect  to  the  phase  estimate  ©(x),  and  the  derivative 
of  the  error  does  move  one  in  a  direction  approximately  towards  the 
Gerchberg-Saxton  phase,  ©'(x),  of  Eq.  (9).  However,  the  direction, 
according  to  Eq.  (A-4),  is  somewhat  different  than  that  of  the 
Gerchberg-Saxton  algorithm;  and  the  step  size,  according  to 
Eqs.  (A-12)  and  (A-16)  is  considerably  larger  than  that  of  the 
Gerchberg-Saxton  algorithm.  Experimental  results  using  the  steepest 
descent  method  for  the  problem  of  two  intensity  measurements  are 
shown  in  Reference  2. 

The  relationship  between  a  gradient  search  method  and  the  error- 
reduction  algorithm  for  a  problem  in  digital  filter  design  is  dis¬ 
cussed  in  Reference  13. 

4.  OTHER  GRADIENT  SEARCH  METHODS 

As  shown  in  Section  3  for  the  phase  problem  of  a  single  intensity 
measurement,  the  steepest  descent  method  is  equivalent  to  the  error- 
reduction  algorithm.  And  as  described  in  Section  2,  although  there 
is  a  convergence  proof  for  the  error-reduction  algorithm,  it  in  prac¬ 
tice  converges  very  slowly  for  the  problem  of  a  single  intensity 
measurement.  Slow  convergence  of  the  steepest  descent  (also  known 
as  the  optimum  gradient)  method  has  also  been  observed  for  other 
applications  as  well^^.  In  this  section,  some  other  gradient 
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search  methods  are  briefly  described,  and  in  Section  6  it  will  be 
shown  that  in  practice  they  converge  much  faster  than  the  steepest 
descent  method  for  this  problem. 

Recall  from  Eq.  (30)  that  the  steepest  descent  method  moves  from 
the  point  g|j(x)  in  parameter  space  to  the  point 

g|;;(x)  -  g|^(x)  -  h^a^B^  (34) 

where  h|^,  the  step  size,  is  a  positive  constant  and  the  gradient 
is  given  by  Eq.  (28).  For  many  applications,  one  would  search  along 
the  direction  of  -agB|^,  evaluating  B  repeatedly  until  the  value 
of  h|^  that  minimizes  B  is  found;  then  from  that  point,  one  would 
recompute  the  gradient  and  go  off  in  a  new  direction.  For  this 
application,  however,  since  an  N  x  N  Fourier  transform  is  required 
to  evaluate  B  and  only  one  more  Fourier  transform  is  required  to 
recompute  the  gradient,  one- may  as  well  recompute  the  gradient  at 
each  step.  After  each  step  of  this  or  the  other  gradient  methods 
described  later,  one  must  then  satisfy  the  object-domain  constraints 
to  form  the  new  estimate  as  was  done  in  Eq.  (10), 

;  gj^(x),  X  i  r 

g|^+l(x)  -  <  (35) 

*  0,  X  e  T 

where  the  set  y  is  defined  as  in  Eq.  (10).  In  Section  3,  the 
"optimum”  double-length  step  value  of  h|^  was  shown  to  be  unity, 
for  which  the  steepest  descent  method  is  equivalent  to  the  error- 
reduction  algorithm.  In  fact,  1  leads  to  a  point 
at  which  B  =  0.  This  is  not  a  solution,  however,  unless  gjj(x) 
satisfies  the  object-domain  constraints.  With  this  in  mind,  other 
values  of  h|^  are  in  practice  better,  as  will  be  shown  in  Section  6. 

A  useful  block  diagram  representation  of  this  and  other  gradient 
methods  is  shown  in  Figure  3,  which  emphasizes  the  similarity  of 
gradient  methods  to  the  error -reduction  algorithm.  The  first  three 


g|^(x)  -  g,^'(x) 


steps  of  the  error-reduction  algorithm,  resulting  in  the  computation 
of  g'(x),  does  most  of  the  work  of  computing  the  gradient.  The  final 
step  of  satisfying  the  object-domain  constraints  is  common  to  gradi¬ 
ent  methods  and  the  error-reduction  algorithm.  Therefore,  one  can 
think  of  the  error-reduction  algorithm  as  a  special  case  of  a  more 
general  class  of  gradient  methods.  For  the  error-reduction  algorithm 
(or  the  double-length  step  steepest  descent  method),  it  just  happens 
that  g|^'(x)  =  g,^'(x). 

A  gradient  search  method  that  is  superior  to  the  steepei-t  descent 

method  is  the  conjugate  gradient  method.  For  that  method,  Eq.  (34) 
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is  replaced  by 

gi;,'(x)  »  g,^(x)  +  \D|^(x)  (36) 

where  the  direction,  0|^(x),  is  given  by 

-  -('/2)>gB,  *  r  ■  (32) 

L  y  /  y 

which,  using  Eqs.  (28)  and  (31)  can  be  written  as 

0|^(x)  »  g^(x)  -  g,^(x)  +  )D^_^ (x)  (38) 

where  one  would  start  the  first  iteration  with  D^(x)  =  g](x)  -  gi(x). 
After  using  Eq.  (36),  one  would  employ  Eq.  (35)  to  form  the  new  esti¬ 
mate,  as  indicated  in  Figure  3. 

Numerous  variations  on  these  gradient  methods  are  possible.  For 
example,  one  could  argue  that  from  one  iteration  to  the  next,  the 
solution  is  going  in  the  following  direction, 

D,^(x)  =  g,^(x)  -  g,^_-i(x)  (39) 

Since  the  step  in  that  direction  may  be  too  small,  a  better  point  to 
go  to  would  be 


gi;!(x)  -  g|^(x)  +  h|^[gk(x)  -  gk_i(’^)3  (^o) 

wtiere  the  parameter  hj^  controls  the  step  size.  In  Eq.  (40),  one 
jumps  from  the  point  g|^(x)  rather  than  from  gj^(x)  since  presumably 
g|^(x)  is  closer  to  the  solution  than  g|^(x).  After  using  Eq.  (40), 
one  would  employ  Eq.  (35)  to  form  the  new  estimate. 

A  method  that  does  not  seem  as  practical  for  this  problem  is  that 
of  (damped)  least  squares  (or  Newton-Raphson)^^"^^.  Since  each 
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iteration  of  a  least-squares  method  involves  the  inversion  of  N 

2 

by  N  matrix,  a  large  number  of  iterations  of  one  of  the  gradient 
methods  or  of  one  of  the  iterative  Fourier  transform  methods 
described  in  the  next  section  could  be  performed  in  the  same  time  it 
takes  to  do  a  single  iteration  of  least  squares.  Furthermore,  as 
has  been  discussed  above,  one  can  readily  find  a  point  g|^'(x)  *  gi;'(^) 
at  which  the  error  B  is  equal  to  zero,  and  so  a  more  sophisticated 
(and  more  difficult)  method*  such  as  least  squares,  of  finding  such 
a  point  is  not  warranted. 

The  problem  here  is  that  one  is  constantly  running  into  the 
object-domain  constraints  on  g(x).  An  approach  that  would  be 
superior  to  the  ones  considered  here  would  be  one  that  minimizes  the 
Fourier-domain  error  while  inherently  satisfying  the  object-domain 
constraints,  or  one  that  minimizes  an  error  metric  that  combines  the 
Fourier-domain  constraints  and  the  object-domain  constraints.  An 
example  of  the  former  is  the  use  of  a  gradient  search  method  for  the 
problem  of  two  intensity  measurements;  by  searching  over  e(x),  one 
automatically  satisfies  the  object-domain  constraint  that  it  have  a 
given  modulus,  |f(x)  ].  Something  along  these  lines  would  be  very 
useful  for  the  problem  of  a  single  intensity  measurement;  clearly, 
more  could  be  done  in  this  area. 

5.  INPUT-OUTPUT  ALGORITHMS 

A  solution  to  the  problem  of  the  slow  convergence  of  the  error- 
reduction  algorithm  has  been  the  input-output  algorithm,  which  has 
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proven  to  converge  faster  for  both  the  problem  of  two  intensity  mea¬ 
surements  and  the  problem  of  a  single  intensity  measurement^’^. 
The  input-output  algorithm  differs  from  the  error-reduction  algorithm 
only  in  the  object-domain  operation.  The  first  three  operations  — 
Fourier  transforming  g(x),  satisfying  the  Fourier  domain  constraints, 
and  inverse  Fourier  transforming  the  result  —  are  the  same  for  both 
algorithms.  Those  three  operations,  if  grouped  together  as  indicated 
in  Figure  4,  can  be  thought  of  as  a  nonlinear  system  having  an  input 
g(x)  and  an  output  g'(x).  The  useful  property  of  this  system  is  that 
its  output  is  always  an  image  having  a  Fourier  transform  that  satis¬ 
fies  the  Fourier-domain  constraints.  Therefore,  if  the  output  also 
satisfies  the  object-domain  constraints,  then  it  is  a  solution  to 
the  problem.  Unlike  the  error-reduction  algorithm  and  the  gradient 
methods,  the  input  g(x)  no  longer  must  be  thought  of  as  the  current 
best  estimate  of  the  object;  instead,  it  can  be  thought  of  as  the 
driving  function  for  the  next  output,  g'(x).  The  input  g(x)  does 
not  necessarily  satisfy  the  object-domain  constraints.  This  view¬ 
point  allows  one  a  great  deal  of  flexibility  and  inventiveness  in 
selecting  the  next  input,  and  allows  for  the  invention  of  algorithms 
that  converge  more  rapidly  to  a  solution.  The  input-output  algo¬ 
rithm,  then,  is  actually  a  class  of  algorithms,  as  will  be  described 
below. 

As  described  elsewhere®’^’^^,  it  has  been  found  that  a  small 
change  of  the  input  result  in  a  change  of  the  output  in  the  same 
general  direction  as  the  change  of  the  input.  More  precisely,  for  a 
small  change  of  the  input,  the  expected  value  of  the  corresponding 
change  of  the  output  is  a  constant,  a,  times  the  change  of  the  input. 
Since  additional  nonlinear  terms  also  appear  in  the  output,  the 
change  of  the  output  due  to  a  particular  change  of  the  input  cannot 
be  predicted  exactly.  Nevertheless,  by  appropriate  changes  of  the 
input,  the  output  can  be  pushed  in  the  general  direction  that  is  de¬ 
sired.  If  a  change  Ag(x)  is  desired  in  the  output,  then  a  logical 


choice  of  the  change  of  the  input  to  achieve  that  change  of  the  out¬ 
put  would  be  s&g(x),  where  0  is  a  constant  ideally  equal  to‘a“^. 

For  the  problem  of  phase  retrieval  from  a  single  intensity  mea¬ 
surement,  the  desired  change  of  the  output  is 

0,  X  /  Y 

Ag,^(x)  =  (41) 

I  X  e  Y 

where  y  is  the  set  of  points  at  which  g|^(x)  violates-the  object- 
domain  constraints.  That  is,  where  the  constraints  are  satisfied, 
one  does  not  require  a  change  of  the  output;  but  where  the  con¬ 
straints  are  violated,  the  desired  change  of  the  output,  in  order  to 

have  it  satisfy  the  object-domain  constraints,  is  one  that  drives  it 
to  a  value  of  zero  (and  therefore,  the  desired  change  is  the  negative 
of  the  output  at  those  points).  Therefore,  a  logical  choice  for  the 
next  input  is 


We  will  refer 
algorithm. 


An  interesting  property  of  the  nonlinear  system  (consisting  of 
the  set  of  three  steps  mentioned  above)  is  that  if  an  output  g'  is 
used  as  an  input,  then  its  output  will  be  itself.  Since  the  Fourier 
transform  of  g'  already  satisfies  the  Fourier -domain  constraints,  g' 
is  unaffected  as  it  goes  through  the  system.  Therefore,  irrespective 
of  what  input  actually  resulted  in  the  output  g',  the  output  g'  can 
be  considered  to  have  resulted  from  itself  as  an  input.  From  this 
point  of  view,  another  logical  choice  for  the  next  input  is 
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(43) 


g|^+l(x)  »  g,|(x)  +  ftAg^(x) 


gi|(x),  X  y  Y 

a 

g||{x)  -  ag|^(x),  X  e  Y 

We  will  refer  to  the  use  of  Eq.  (43)  as  the  output-output  algorithm. 

Note  that  if  a  *  1  in  Eq.  (43),  then  the  output-output  algorithm 
reduces  to  the  error-reduction  algorithm  of  Eq.  (10).  Since  the 
optimum  value  of  a  is  usually  not  unity,  the  error-reduction  algo¬ 
rithm  can  be  looked  on  as  a  sub-optimal  version  of  a  more  general 
approach. 


Still  another  method  of  choosing  the  next  input  which  was  inves¬ 
tigated  is  a  combination  of  the  upper  line  of  Eq.  (43)  with  the  lower 
line  of  Eq.  (42): 


gk.,(x)  . 


gk(x),  X  y  Y 

gi^(x)  -  3g|^(x),  X  e  Y 


(44) 


We  will  refer  tc  the  use  of  Eq.  (44)  as  the  hybrid  input-output 
algorithm.  The  hybrid  input-output  algorithm  is  an  attempt  to  avoid 
a  stagnation  problem  that  tends  to  occur  with  the  output-output 
algorithm.  The  output-output  algorithm  often  works  itself  into  a 
situation  in  which  the  output  on  successive  iterations  does  not 
change,  despite  being  far  from  a  solution.  For  the  hybrid  input- 
output  algorithm,  on  the  other  hand,  if  at  a  given  value  of  x,  the 
output  remains  negative  for  more  than  one  iteration,  then  the  cor¬ 
responding  point  in  the  input  continues  to  grow  larger  and  larger 
until  eventually  that  output  value  must  go  nonnegative. 

For  the  input-output  algorithms,  the  error  Ep  is  usually  mean¬ 
ingless  since  the  input  g|^(x)  is  no  longer  an  estimate  of  the  ob¬ 
ject.  Then  the  meaningful  error  is  the  object-domain 'error,  E^, 
given  by  Eq.  (15). 
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For  the  problem  of  phase  retrieval  from  two  intensity  measure¬ 
ments,  the  desired  change  of  the  output  takes  a  different  form,  and 
it  is  described  elsewhere^’^’^^  in  the  context  of  a  computer  holog¬ 
raphy  synthesis  problem  involving  intensity  constraints  in  each  of 
the  two  domains. 

6.  EXPERIMENTAL  COMPARISON  OF  PHASE  RETRIEVAL  ALGORITHMS 

In  this  se  on  the  gradient  search  and  input-output  algorithms 
are  compared  Tl.  the  problem  of  phase  retrieval  from  a  single  inten¬ 
sity  measurement,  by  using  them  all  on  the  same  Fourier  modulus  data 
and  with  the  same  starting  input.  For  each  approach,  several  differ¬ 
ent  values  of  the  algorithm  parameter  (h  or  s)  were  tried.  The  prin¬ 
cipal  problem  with  the  error-reduction  algorithm  is  that  it  tends  to 
stagnate  after  a  few  iterations.  For  this  reason,  the  starting  point 
for  the  iterations  was  chosen  to  be  a  partially  reconstructed  image 
on  which  the  error-reduction  algorithm  was  making  slow  progress. 
Figure  5  shows  a  plot  of  E^,  the  rms  error,  versus  the  number  of 
iterations  beyond  this  starting  point,  using  the  error-reduction  al¬ 
gorithm.  Starting  at  0.071275,  E^  decreased  slowly  but  steadily 
to  0.067470  after  ten  iterations  and  0.063373  after  19  iterations. 

In  this  paper,  all  values  of  E  are  normalized  by  dividing  by  the 

9  ^ 

square  root  ofJ^Cg' (x)]*^,  the  total  image  energy.  The  object  for 
the  experiment  described  in  this  section  is  a  digitized  photograph 
of  a  satellite  in  a  field  of  view  of  128  x  128  pixels,  and  its 
Fourier  modulus  is  noise-free. 

The  algorithms  were  compared  by  performing  ten  iterations  of  each 
algorithm,  followed  by  nine  iterations  of  the  error-reduction  algo¬ 
rithm  (a  total  of  19  iterations)  using  the  same  starting  input  for 
each.  During  the  first  ten  iterations,  the  value  of  the  algorithm 
parameter  s  or  h  was  held  constant.  The  reason  that  each  algorithm 
was  followed  by  nine  iterations  of  the  error-reduction  algorithm  is 
as  follows.  In  many  cases,  it  has  been  observed  that  definite 


progress  (i.e.,  improved  visual  quality  of  the  output  image)  is  being 
made  with  an  input-output  algorithm  even  though  decreases  very 
little  or  even  increases  with  each  iteration.  The  relationship  be¬ 
tween  Eq  and  the  visual  image  quality  is  not  fully  understood, 
although,  of  course,  one  would  expect  a  high  degree  of  correlation 
between  the  two.  For  those  cases  for  which  the  visual  quality  im¬ 
proves  while  Eq  does  not,  it  was  found  that  if  one  then  performs  a 
few  (say  five  or  ten)  iterations  of  the  error-reduction  algorithm, 
then  the  visual  quality  of  the  output  image  changes  very  little,  but 
Eq  decreases  rapidly  until  it  becomes  more  consistent  with  the 
visual  image  quality.  Therefore,  in  order  to  gauge  the  true  progress 
of  an  input-output  algorithm  using  the  value  of  E^,  a  few  itera¬ 
tions  of  the  error-reduction  algorithm  are  performed  after  the  itera¬ 
tions  of  the  input-output  algorithm.  For  this  reason,  for  all  cases 
the  ten  iterations  of  the  algorithm  being  tested  were  followed  by 
nine  iterations  of  the  error-reduction  algorithm  in  order  to  make  a 
fair  comparison. 

Plots  of  Eq  after  the  set  of  19  iterations  described  above  (ten 
iterations  followed  by  nine  iterations  of  the  error-reduction  algo¬ 
rithm)  are  shown  in  Figure  6  as  a  function  of  the  algorithm  parameter 
for  the  various  algorithms.  Note  that  both  the  steepest  descent 
method  with  h  *  1 .0  and  the  output-output  algorithm  with  6  =  1.0  are 
equivalent  to  the  error-reduction  algorithm,  at  E^  =  0.063373,  both 
circled  in  Figure  6.  Comparing  these  plots,  it  is  seen  that  the 
algorithm  which  most  reduced  the  error  after  the  set  of  19  iterations 
is  the  hybrid  input-output  algorithm  with  a  value  of  s  equal  to  about 
unity. 

For  each  algorithm,  the  use  of  a  small  algorithm  parameter  (s  or 
h)  leads  to  a  steady  but  slow  decline  of  E^.  Increasing  the  value 
of  the  parameter  increases  the  speed  at  which  E^  decreases,  until 
one  reaches  a  point  where  the  parameter  is  too  large  and  the  algo¬ 
rithm  becomes  unstable.  The  instability  of  the  algorithm  for  larger 


values  of  the  algorithm  parameter  is  what  makes  possible  more  than 
one  local  minimum  in  the  plots  of  versus  the  algorithm 
parameter. 

For  all  of  the  algorithms,  keeping  h  or  e  fixed  for  all  itera¬ 
tions  is  not  the  best  possible  strategy,  particularly  for  the  gradi¬ 
ent  methods.  At  the  point  at  which  the  error-reduction  algorithm  is 
converging  slowly,  the  gradient  is  small,  and  one  must  then  use  a 
large  value  of  h  in  order  to  make  rapid  progress.  However,  after 
using  a  large  value  of  h  for  a  few  iterations,  one  moves  to  a  point 
where  the  gradient  is  much  larger.  Then  one  should  use  a  smaller 
value  of  h  in  order  to  avoid  algorithm  instability.  If  a  method  for 
adaptively  choosing  h  at  each  iteration  were  devised,  then  one  would 
expect  the  gradient  methods  to  perform  considerably  better  than  the 
results  shown  here  using  a  fixed  value  of  h. 

Examples  of  an  alternative  to  using  a  fixed  value  of  h  and  a  are 
shown  in  Figure  7.  For  the  first  ten  iterations  of  each  algorithm, 
the  indicated  value  of  h  or  a  was  used  for  iterations  k  *  1,  3,  5, 

7,  and  9,  and  the  error-reduction  algorithm  (h  *  1  for  the  steepest 
descent  and  conjugate  gradient)  was  used  for  iterations  k  =  2,  4,  6, 

8,  and  10.  The  iterations  using  the  error-reduction  algorithm  help 
to  stabilize  the  algorithm  by  moving  toward  a  point  where  the  gradi¬ 
ent  is  smaller.  Comparing  Figures  6  and  7,  it  is  seen  that  with  this 
alternative  strategy,  the  optimum  value  of  each  algorithm  parameter 
is  considerably  larger  than  the  optimum  value  when  the  parameter  is 
kept  constant.  At  the  optimum  value  of  the  algorithm  parameter,  the 
alternative  strategy  gave  better  results  (a  lower  value  of  E^  at 

the  end  of  the  sequence  of  19  iterations)  than  those  shown  in 
Figure  6  for  the  basic  input-output  and  for  the  output-output  algo¬ 
rithms;  the  two  strategies  were  comparable  for  the  steepest  descent 
method;  and  for  the  hybrid  input-output  algorithm,  the  alternative 
strategy  gave  poorer  results  than  those  shown  in  Figure  6. 
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Curve  E  in  '^igure  7  shows  the  result  using  the  algorithm  of 
Eq.  (40),  using  the  alternative  strategy  described  above.  This  and 
numerous  other  variations  on  these  algorithms  can  be  used  with  vary¬ 
ing  degrees  of  success. 

The  results  shown  in  Figures  6  and  7  were  for  a  particular  set 
of  Fourier  modulus  data  for  a  particular  stage  of  reconstruction, 
and  for  a  particular  number  of  iterations;  and  the  results  in  other 
circumstances  could  be  significantly  different.  At  the  optimum  val¬ 
ues  of  the  algorithm  parameter  in  each  case,  the  algorithm  parameter 
was  large  enough  to  make  the  algorithm  somewhat  unstable,  and  so 
substantially  different  results  could  be  obtained  if  relatively  small 
changes  in  starting  point,  algorithm  parameter,  or  number  of  itera¬ 
tions  were  made.  In  general,  slower  but  steadier  progress  is  made 
if  an  algorithm  parameter  is  used  that  is  somewhat  smaller  than  the 
optimum  according  to  Figures  6  and  7.  These  results  do  serve  to  show 
trends  that  can  be  expected  to  apply  in  a  wider  range  of  circum¬ 
stances.  Further  development  is  needed  to  determine  the  best 
approach  for  the  general  case.  As  of  this  writing,  the  most  success¬ 
ful  strategy  has  been  to  alternate  between  several  (10  to  30)  itera¬ 
tions  of  the  hybrid  input-output  algorithm  and  a  few  (5  to  10)  itera¬ 
tions  of  the  error-reduction  algorithm. 

I 

Figure  8  shows  E^  versus  the  number  of  iterations  past  the 
starting  point  for  the  hybrid  input-output  algorithm  with  a  »  1 
(curve  B)  and  for  the  error -reduction  algorithm  (curve  A,  repeated 
from  Figure  5).  Curve  B1  shows  the  results  for  the  set  of  nineteen 
iterations  described  above  (ten  iterations  of  the  hybrid  input-output 
algorithm  followed  by  nine  iterations  of  the  error-reduction  algo¬ 
rithm).  Curve  B2  shows  the  results  of  twenty  iterations  of  the 
hybrid  input-output  algorithm  followed  by  a  few  iterations  of  the 
error-reduction  algorithm.  The  instability  of  the  hybrid  input- 
output  algorithm  is  seen  in  Curve  B,  in  which  E^  increases  from 
0.071275  to  0.137707  during  the  first  four  iterations.  By  the  end 
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of  ten  iterations  decreases  to  0.091176,  still  worse  than  the 
starting  point,  although  the  appearance  of  the  image  is  improved  from 
the  starting  point.  At  the  eleventh  iteration  of  Curve  Bl,  the  first 
iteration  of  the  error-reduction  algorithm,  drops  sharply  to  a 
value  of  0.047244,  although  the  appearance  of  the  output  image 
changes  little  from  that  of  the  tenth  iteration.  If  after  the  tenth 
iteration  the  hybrid  input-output  iterations  are  continued  for  ten 
more  iterations  (Curve  B2),  then  E^  continues  to  decrease  to  well 
below  the  level  of  E^  for  the  error-reduction  algorithm  alone 
(Curve  A).  Similar  to  the  case  of  Bl,  after  the  twentieth  iteration 
of  the  hybrid  input-output  algorithm  (Curve  B2),  when  a  few  itera¬ 
tions  of  the  error-reduction  algorithm  were  performed,  again  E^ 
fell  rapidly  to  a  level  consistent  with  the  improved  image  quality 
that  was  present.  This  is  the  characteristic  of  the  input-output 
algorithms  that  mandated  the  use  of  a  few  iterations  of  the  error- 
reduction  algorithm  to  make  a  fair  comparison.  For  the  input-output 
algorithms,  the  image  quality  is  often  better  than  what  one  would 
infer  from  the  value  of  E^. 

7.  DIAMETER  CONSTRAINT,  STARTING  INPUT,  AND  STRIPES 

A  side  issue  which  is  pertinent  to  all  algorithms  is  how  one 
defines  the  diameter  of  the  object  for  the  purpose  of  applying  the 
diameter  constraint  in  the  object  domain.  For  the  problem  of  phase 
retrieval  from  a  single  intensity  measurement,  the  object  is  usually 
of  finite  extent  and  on  a  dark  (zero  value)  background.  Then  bounds 
on  the  support  of  the  object  (the  set  of  points  over  which  the  object 
is  nonzero)  can  be  determined  from  the  support  of  the  autocorrela¬ 
tion,  which,  being  the  inverse  Fourier  transform  of  the  square  of 
the  Fourier  modulus,  can  be  computed  from  the  given  data.  As  shown 

in  Reference  10,  for  extended  objects  the  support  of  the  autocorrela¬ 
tion  usually  does  not  uniquely  define  the  support  of  the  object. 

Nevertheless,  reasonably  tight  bounds  can  be  made  on  the  support  of 
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the  object.  Locator  sets^®  can  be  defined  that  contain  all  possi¬ 
ble  solutions.  One  can  therefore  define  a  region  (a  "mask")  outside 
of  which  the  output  image  is  constrained  to  be  zero.  That  is,  the 
set  Y  defined  in  Eq.  (10)  includes  all  points  outside  the  mask  for 
which  g|^(x)  ^  0.  One  need  not  use  this  information  in  order  for 
the  algorithm  to  converge,  but  it  is  desirable  to  do  so,  since  using 
this  additional  information  speeds  up  the  convergence  of  the 
algorithm. 

A  problem  often  occurs  with  the  diameter  constraint  even  though 
the  mask  region  is  correctly  defined.  If  the  partially  reconstructed 
image,  g|^(x),  is  not  centered  in  the  mask  region,  then  in  applying 
the  diameter  constraint,  one  might  be  inadvertently  trying  to  "chop 
off"  (truncate)  one  part  of  the  object,  which  usually  results  in 
stagnation  of  the  algorithm.  For  this  reason,  it  is  usually  advanta¬ 
geous  to  define  the  mask  as  being  somewhat  larger  than  the  extent  of 
the  object. 

We  have  found  a  good  strategy  for  choosing  the  mask,  as  follows. 
For  the  first  several  iterations,  define  a  smaller  mask  which  very 
tightly  constrains  the  object.  This  helps  to  speed  up  the  conver¬ 
gence  of  the  algorithm  initially,  but  slows  it  down  for  later  itera¬ 
tions  when  the  problem  mentioned  above  becomes  more  significant. 

Then  for  later  iterations,  use  a  larger  mask  which  insures  that  none 
of  the  solution  is  being  truncated  by  the  mask.  Logical  choices  for 
masks  are  any  of  the  locator  sets^®. 

Faster  convergence  can  be  expected  if  the  starting  input,  gi(x), 
is  closer  to  the  solution.  A  good  starting  input  is  formed  as  fol¬ 
lows.  Compute  the  autocorrelation  function,  and  then  demagnify  it 
by  a  factor  of  two  (save  only  every  other  pixel  in  both  dimensions). 
Then  threshold  the  demagnified  autocorrelation  at  a  value  which  is  a 
small  fraction  of  its  peak,  setting  it  equal  to  zero  wherever  it  is 
below  the  threshold  value.  Finally,  replace  each  value  above  the 
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threshold  with  a  sample  of  a  random  variable  uniformly  distributed 
between  zero  and  unity.  The  result  is  a  random  (unbiased)  starting 
input  having  approximately  the  same  size  and  shape  as  the  original 
object. 

A  curious  phenomenon  often  occurs  for  the  problem  of  phase  re¬ 
trieval  from  a  single  intensity  measurement.  The  phase  retrieval 
algorithm  often  stagnates  at  a  local  minimum  characterized  by  a  pat- 
tern  of  stripes  across  the  image  ’  .  In  most  cases,  the  stripes 

are  barely  noticeable  and  are  of  low  contrast,  superimposed  on  an 
otherwise  excellent  reconstructed  image.  In  some  cases,  the  stripes 
are  of  high  enough  contrast  to  be  objectionable,  although  they  still 
permit  the  object  to  be  recognized.  The  cause  of  this  phenomenon  is 
not  well  understood,  but  it  is  thought  that  it  is  an  algorithm  con- 
vergence  problem  rather  than  a  uniqueness  problem  (it  is  at  a 
local,  not  a  global,  minimum  of  E^).  A  method  of  avoiding  this 
phenomenon  is  presently  being  sought,  although  it  fortunately  is  not 
much  of  a  problem  in  most  cases. 

8.  IMAGE  RECONSTRUCTION  EXAMPLE 

An  example  of  a  computer  experiment  using  the  iterative  recon¬ 
struction  algorithm  for  the  problem  of  phase  retrieval  from  a  single 
intensity  measurement  is  shown  in  Figure  9.  In  this  example,  a  real¬ 
istic  simulation  was  performed  to  arrive  at  the  kind  of  noisy  Fourier 
modulus  data  that  would  be  provided  by  stellar  speckle  interferom¬ 
etry^®,  including  the  effects  of  atmospheric  turbulence  and  photon 

-  18 
noise  . 

An  undegraded  object,  a  digitized  photograph  of  a  satellite, 
shown  in  Figure  9(a),  was  convolved  with  156  different  point-spread 
functions  to  produce  156  different  blurred  images.  Each  of  the 
point-spread  functions  represented  a  different  realization  of  the 
effects  of  the  turbulent  atmosphere.  The  blurred  images  were  then 
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subjected  to  a  Poisson  noise  process  to  simulate  the  effects  of 

photon  noise.  For  this  example,  there  were  approximately  300,000 

photons  per  degraded  image  (or  on  the  order  of  100  photons  per  pixel 

over  the  extent  of  the  object),  which  is  realistic  for  objects  of 

this  type  when  imaged  through  a  telescope  of  diameter  1.2  meters. 

Two  of  the  resulting  156  degraded  images  are  shown  in  Figures  9(b) 

20 

and  9(c).  The  degraded  images  were  then  processed  by  Labeyrie's 

21 

method,  as  modified  by  Goodman  and  Belsher  .  The  estimate  of  the 

18 

modulus  of  the  Fourier  transform  of  the  object  is  given  by 

1/2 

(45) 


where  In,('J)  is  the  Fourier  transform  of  the  m^*^  degraded  image,  Np 
is  the  total  number  of  photons  detected  (it  is  subtracted  in  order 
to  compensate  for  a  certain  noise  bias  term  that  arises  in  the  power 
spectrum  due  to  photon  noise  ),  S  (u)  is  the  Fourier  transform  of 
the  m  point-spread  function  (to  provide  compensation  for  the  MTF 
of  the  speckle  interferometry  process)  and  the  weighting  factor  W(u) 
is  the  MTF  due  to  the  telescope  aperture.  In  practice,  the  denom¬ 
inator  of  this  expression  would  be  obtained  by  making  measurements 
on  a  reference  star  through  an  atmosphere  having  the  same  statistics 
as  that  which  blurred  the  images  or  by  using  a  model  of  the  effects 
of  atmospheric  turbulence.  W(u)  was  included  in  order  to  restore 
the  natural  MTF  due  to  the  telescope  aperture  which  was  removed  by 
the  denominator  of  the  equation  above.  Figure  9(d)  shows  the  result¬ 
ing  Fourier  modulus  estimate. 

The  object  was  reconstructed  (or  equivalently,  the  Fourier  phase 
was  retrieved)  using  the  hybrid  input-output  algorithm  alternately 
with  the  error-reduction  algorithm.  The  result,  shown  in  Figure  9(e), 


|F(u)|  »  W(u) 


M 

m=l 

m=M+l 


agrees  very  well  with  the  original  object  shown  in  Figure  9(a), 
despite  the  noise  present  in  the  Fourier  modulus  data.  Good  recon¬ 
structed  images  were  also  obtained  when  only  one  tenth  as  many 

18 

photons  were  assumed  to  be  available  . 

Figure  10  shows  versus  the  number  of  iterations  for  this 
reconstruction.  The  starting  input  used  was  the  randomized  demagni- 
fied  autocorrelation  described  in  the  previous  section,  using  a 
threshold  value  of  0.004  times  the  peak  of  the  autocorrelation.  For 
the  first  ten  iterations,  the  error-reduction  algorithm  was  used, 
and  the  mask  defining  the  diameter  constraint  was  chosen  to  be  the 
region  over  which  the  autocorrelation  function,  spatially  demagnified 
by  a  factor  of  two,  exceeded  0.004  of  its  maximum  value  (providing  a 
fairly  tight  diameter  constraint).  For  iterations  11  to  20,  the 
error-reduction  algorithm  wa'.  tsed,  and  the  mask  for  these  and  the 
remaining  iterations  was  ..nosen  to  be  a  square  of  length  64  pixels, 
which  is  larger  than  the  actual  object  extent  of  about  60  by  40 
pixels  (imbedded  in  an  array  of  128  by  128  pixels).  The  error  de¬ 
creased  suddenly  at  the  tenth  iteration  since  some  positive-valued 
points  that  were  inside  the  second  mask  but  outside  the  first  mask 
were  no  longer  counted  as  contributing  to  E^.  By  the  twentieth 
iteration,  the  error-reduction  algorithm  was  converging  very  slowly. 
For  iterations  21  to  60,  the  hybrid  input-output  algorithm,  with  s 
equal  to  one,  was  used.  At  first,  E^  increased  sharply  (although 
the  output  image  appeared  no  worse  than  at  iteration  20),  but  then 
decreased  fairly  rapidly  until  stagnating  at  E^  =  0.05  at  about 
iteration  55.  For  iterations  61  to  70,  the  error-reduction  algorithm 
was  used,  for  which  E^  dropped  suddenly  from  0.05  to  0.02,  although 
the  visual  appearance  of  the  reconstructed  image  remained  the  same 
as  for  iteration  number  60. 

This  final  value  of  E  is  comparable  to  0.03,  the  normalized 

^  18 

rms  error  of  the  Fourier  modulus  estimate  itself  .  It  is  impos¬ 
sible  to  reduce  E^  to  zero  since  the  noise  in  the  Fourier  modulus 


estimate  results  in  an  inconsistency  of  the  nonnegativity  constraint 
and  the  Fourier  modulus  estimate.  This  inconsistency  can  be  seen 
from  the  fact  that  the  autocorrelation  estimate  computed  from  the 
Fourier  modulus  estimate  has  areas  of  negative  value. 

Reconstruction  experiments  do  not  always  proceed  as  smoothly  as 
the  one  described  above.  When  stagnation  occurs  before  is  re¬ 
duced  to  a  level  consistent  with  the  error  in  the  Fourier  modulus 
data,  often  the  best  strategy  is  to  restart  the  algorithm  using  a 
different  set  of  random  numbers  as  the  initial  input.  One  must  also 
be  careful  in  the  way  one  chooses  the  mask  as  discussed  in  Section  7. 
Finally,  it  is  always  advisable  to  reconstruct  the  image  two  or  three 
times  from  the  same  Fourier  modulus  data  using  different  starting 
inputs  each  time.  If  the  same  image  is  reconstructed  each  time,  then 
one  would  have  confidence  that  the  solution  is  unique. 

9.  COMMENTS  AND  CONCLUSIONS 

For  the  problem  of  phase  retrieval  from  a  single  intensity  mea¬ 
surement,  it  has  been  shown  that  the  error-reduction  algorithm  (the 
Gerchberg-Saxton  algorithm  applied  to  this  problem)  is  equivalent  to 
the  steepest  descent  method  with  a  double-length  step.  Furthermore, 
it  was  shown  that  the  error-reduction  algorithm  converges  in  the 
sense  that  the  error  monotonically  decreases.  However,  in  practice 
the  error-reduction  algorithm  converges  very  slowly;  and  several 
other  algorithms,  including  other  gradient  search  algorithms  which 
utilize  the  method  of  Fourier  transforms  for  rapidly  computing  the 
gradient  of  the  error,  and  the  input-output  family  of  iterative 
Fourier  transform  algorithms,  converge  much  faster.  Of  the  algo¬ 
rithms  investigated  so  far,  the  hybrid  input-output  algorithm  has 
converged  the  fastest.  Nevertheless,  the  gradient  search  algorithms 
also  converge  reasonably  fast,  and  they  deserve  further  development. 
The  performance  of  any  given  algorithm  can  vary  depending  on  the 
stage  of  reconstruction  and  on  what  is  being  reconstructed.  Although 
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a  practical  phase  retrieval  algorithm  is  in  hand,  there  is  still  an 
element  of  trial-and-error  involved. 

For  the  problem  of  a  phase  retrieval  from  two  intensity  measure¬ 
ments,  the  steepest  descent  method  and  the  Gerchberg-Saxton  algorithm 
were  compared  and  were  found  to  share  some  similarities.  When  manip¬ 
ulating  just  the  object's  phase,  the  object-domain  constraint,  that 
the  object's  modulus  equal  the  measured  modulus,  is  automatically 
satisfied.  Therefore,  the  gradient  search  methods  seem  to  be  inher¬ 
ently  better  suited  to  the  problem  of  two  intensity  measurements  than 
to  the  problem  of  a  single  intensity  measurement. 

Not  considered  here  has  been  the  question  of  the  uniqueness  of 

solutions  to  either  of  the  two  phase  retrieval  problems.  There  has 

been  considerable  controversy  surrounding  this  question,  both  for 

2  22  23 

the  problem  of  two  intensity  measuranents  ’  ’  and  for  the  problem 

of  a  single  intensity  measurement  .  For  the  latter  problem,  the 
experimental  reconstruction  results  shown  here  and  elsewhere^*^*^®*^® 
suggest  that  for  complicated  two-dimensional  objects,  the  solution 
is  usually  unique  in  practice,  even  when  the  Fourier  modulus  data  is 

ig 

corrupted  by  a  considerable  amount  of  noise  . 

This  research  was  supported  by  the  Air  Force  Office  of  Scientific 
Research  under  Contract  No.  F49620-80-C-0006. 
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APPENDIX  A 


In  this  appendix,  the  relationship  between  the  steepest  descent 
method  for  the  problem  of  two  intensity  measurements  and  the 
Gerchberg-Saxton  algorithm  is  explored. 

For  the  problem  of  two  intensity  measurements,  the  error  B  is 
minimized  with  respect  to  the  phase  estimates  &(x).  The  partial 
derivative  of  B  with  respect  to  the  value  of  a  given  phase,  &(x),  is 


Since 


3,B  = 


3B 

ad(x) 


2N-2  2[|e(u)|  -  |F(u)|] 
u 


3  |G(u)  I 
ad(x}  ■ 


(A-1) 


a&{^  *  a&brrS  [-i2TTU-y/N] 

y 

=  i|f(x)|  exp  [i®(x)]  exp  [-i2iru*x/N] 

«  ig(x)  exp  [-i2iru.x/N]  ,  (A-2) 

then 

a  IG(u)  I  G(u)(-i)g*(x)  exp  [i2Tru»x/N]  c.c. 
a©(x)  *  2!G{u)1 

where  c.c.  indicates  the  complex  conjugate  of  what  precedes  it. 

2 

Using  Eq.  (27),  Eq.  (A-1)  becomes 

a^B  =  ig*(x)[g'(x)  -  g(x)]  +  c.c. 

=  ig*(x)g'(x)  +  c.c,  =  -2  Im  [g*(x)g'(x)] 

=  -2  |f(x)  1  Ig'(x)  I  sin  [e'(x)  -  o(x)]  (A-4) 

where  o'(x)  is  the  ph-ase  of  g'(x).  Therefore,  the  gradient  can  be 
easily  computed  using  two  Fourier  transforms  (the  first  three  steps 
of  the  Gerchberg-Saxton  algorithm)  plus  evaluating  Eq.  (A-4).  Analo- 
-gous  to  the  linear  approximation  of  Eq.  (29)  is 


®  ~  2  ®  (A-5) 

X 

where  is  a^B  evaluated  at  &(x)  =  ©|^(x),  the  phase  of  gj^(x). 

This  linear  expansion  of  B  is  equal  to  zero  for 


9(x)  -  \M - .  (A-6) 

y 

which  can  be  easily  verified  by  inserting  Eq.  (A-6)  into  Eq.  (A-5). 

By  Eq.  (A-4), 


2  “  ^2  l^(y)|^|9k(y)|^  sin^  [»,^(y)  -  9i((y)l  .  (A-7) 

y  y 

and  so  Eq.  (A-6)  becomes 

Bu  |f(x)gMx)  I  siri  [®Mx)  -  9,.(x)] 

9(X)  -  9,^(X)  =  - ^ 5 ^ K ^ 

2  2  K(y)  1^  l9k(y)  1^  sin^  [9,^(y)  -  \(y)] 
y 

Now  consider  the  situation  after  several  iterations  have  been  per¬ 
formed,  and  the  error  is  decreasing  slowly.  Then  for  most  values  of 

/ 

sin  [©,^(x)  -  ©,^(x)]  =  o^(x)  -  ©,^(x)  .  (A-9) 

Then  from  Eq.  (A-8),  it  is  seen  that  o(x)  -  ®|^(x),  which  is  the 
change  of  9|^(x)  indicated  by  the  steepest  descent  method,  is  approx¬ 
imately  proportional  to  0||(x)  -  0|^(x).  Recall  from  Eq.  (9)  that 
&l|(x)  is  new  phase  that  would  be  given  by  the  Gerchberg-Saxton 
algorithm.  Thus,  for  the  problem  of  two  intensity  measurements,  the 
steepest  descent  method  gives  a  new  phase  that  is  going  in  the  same 
general  direction  as  the  Gerchberg-Saxton  algorithm,  but  uses  a 
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different  step  size.  For  the  steepest  descent  method,  the  direction 
differs  somewhat  from  ®']^(x)  -  &|^{x)  due  to  the  weighting  factor 
[^(x)g|^(x)[  which  varies  with  x. 

In  order  to  obtain  some  notion  of  the  step  size  for  the  steepest 
descent  method,  continue  to  consider  the  situation  where  the  sum  is 
decreasing  and  Eg.  (A-9)  applies.  Then  using  Eq.  (16),  one  can 
approximate 

Ci»k(x)]  -  |9k(x)|  exp 

X 

“  -  21f(x)gj^(x)  I  cos  C©,^(x)  -  ©|^(x)]J 

X 

=  2^  C  H'(X)|  -  |g,|(x)|]^  +  ^  |f(x)g,^(x)  I  sin^  [&,^(x)  -  &,^(x)] 

^  ^  (A-10) 

In  order  to  further  simplify  things,  consider  the  special  case  for 
which  f(x)  »  fjj,  a  constant  independent  of  x.  Furthermore,  if  B 
is  small,  then  for  most  values  of  x, 

I  l^(x)  I  -  |g|^(x)  !|  «  |f(x)  I  .  (A-11) 

Under  these  circumstances,  inserting  Eqs.  (A-9)  and  (A-10)  into  (A-8) 
yields 

Br" 

®kj 

where 

®k  -  |g,;(x)|]^  (A-13) 

X 
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and 


®k  “  Z  l9k(x)l^Cs,;(x)  -  (A-14) 

X 

are  the  respective  "radial"  and  "tangential"  components  of  the  error 
=  bJ^  +  bJ^.  The  radial  component,  B*^,  is  the  object-domain 
squared  error  of  the  modulus  of  |gj^(x)j  and  is  equal  to  Eqj^ 
according  to  Eq.  (14).  When  the  error  is  decreasing  slowly,  then 

=•  \  -  4  -  4  «  4  -  (''-’5) 

Therefore,  under  these  circumstances,  the  coefficient  of  Eq.  (A-12), 

1 

I 

That  is,  when  the  Gerchberg-Saxton  algorithm  is  converging  slowly, 
the  steepest  descent  method  indicates  that  a  much  larger  step  size 
be  used.  Note  that  if  one  uses  the  double-length  step  steepest 
descent  method,  then  the  factor  of  1/2  in  Eqs.  (A-8),  (A-12)  and 
(A-16)  is  replaced  by  unity. 


1  t 


»  1 


(A-16) 
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FIGURE  CAPTIONS 


1.  Block  diagram  of  the  error-reduction  (Gerchberg-Saxton) 
algorithm. 

2.  RMS  error  versus  the  number  of  iterations  for  the  problem  of 
phase  retrieval  from  a  single  intensity  measurement  using  the 
error-reduction  algorithm. 

3.  Block  diagram  for  the  gradient  search  methods,  using  the  method 
of  Fourier  transforms  to  compute  the  gradient. 

4.  Block  diagram  of  the  system  for  the  input-output  concept. 

5.  RMS  error  versus  the  number  of  iterations  using  the  error- 
reduction  algorithm. 

6.  RMS  error  after  a  fixed  number  of  iterations  versus  the 
algorithm. parameter.  Curve  A:  steepest  descent  method  (o); 

B:  conjugate  gradient  method  {•);  C:  basic  input-output 
algorithm  (A);  0:  output -output  algorithm  (A);  E:  hybrid 
input-output  algorithm  (□).  The  result  using  the  error- 
reduction  algorithm  is  indicated  by  a  large  circle. 

7.  RMS  error  after  a  fixed  number  of  iterations  (using  the 
alternative  strategy)  versus  the  algorithm  parameter.  Curve  A: 
steepest-descent  method  (o);  B:  basic  input-output  algorithm 
(•);  C:  output-output  algorithm  (A);  D:  ftybrid  input-output 
algorithm  (A);  E:  the  algorithm  of  Eq.  (40)  (a). 

8.  RMS  error  versus  the  number  of  iterations  for  the 
error-reduction  algorithm  (Curve  A),  and  for  the  hybrid 
input-output  algorithm  (Curve  B  —  see  text). 

9.  Image  reconstruction  experiments,  (a)  Undejgraded  object;  (b), 
(c)  examples  of  degraded  images  simulated  to  include  the  effects 
of  atmospheric  turbulence  and  photon  noise;  (d)  Fourier  modulus 
estimate  computed  from  the  degraded  images;  (e)  image 
reconstructed  using  the  iterative  algorithm. 

10.  RMS  error  versus  the  number  of  iterations. 
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Block  diagram  of  the  error-reduction  (Gerchberg-Saxton ) 
algorithm. 
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Figure  2.  RMS  error  versus  the  number  of  Iterations  for  the  problem  of 
phase  retrieval  from  a  single  intensity  measurement  using  the 
error-reduction  algorithm. 
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Figure  3.  Block  diagram  for  the  gradient  search  methods,  using  the  method 
of  Fourier  transforms  to  compute  the  gradient. 
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Figure  4.  Block  diagram  of  the  system  for  the  input-output  concept 
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Figure  5.  RMS  error  versus  the  number  of  Iterations  using  the  error- 
reduction  algorithm. 
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Figure  6.  RMS  error  after  a  fixed  number  of  iterations  versus  the 

algorithm. parameter.  Curve  A;  steepest  descent  method  (o); 
B:  conjugate  gradient  method  (•);  C:  basic  input-output 
algorithm  (A);  0:  output-output  algorithm  (A);  E:  hybrid 
input-output  algorithm  (□).  The  result  using  the  error- 
reduction  algorithm  is  indicated  by  a  large  circle. 
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Figure  7.  RMS  error  after  a  fixed  number  of  iterations  (using  the 

alternative  strategy)  versus  the  algorithm  parameter.  Curve  A 
steepest-descent  method  (o);  B:  basic  input-output  algorithm 
(•);  C:  output-output  algorithm  (A);  0:  hybrid  input-output 
algorithm  (A);  E:  the  algorithm  of  Eq.  (40)  (o). 
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Figure  8.  RMS  error  versus  the.  nurnber  of  iterations  for  the 

error-reduction  algorithm  (Curve  A),  and  for  the  hybrid 
input-output  algorithm  (Curve  B  —  see  text). 


Figure  9,  Image  reconstruction  experiments,  (a)  Undegraded  object;  (b), 

(c)  examples  of  degraded  images  simulated  to  include  the  effects 
of  atmospheric  turbulence  and  photon  noise;  (d)  Fourier  modulus 
estimate  comouted  from  the  degraded  images;  (e)  image 
reconstructed  using  the  iterative  algorithm. 
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Figure  10.  RMS  error  versus  the  number  of  iterations 
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ABSTRACT 

This  paper  reviews  the  Gerchberg-Saxton  algorithm  and  variations 
thereof  that  have  been  used  to  solve  a  number  of  difficult  recon¬ 
struction  and  synthesis  problems  in  optics  and  related  fields.  It 
can  be  used  on  any  problem  in  which  only  partial  information  (includ¬ 
ing  both  measurements  and  constraints)  of  the  wavefront  or  signal  is 
available  in  one  domain  and  other  partial  information  is  available 
in  another  domain  (usually  the  Fourier  domain).  The  algorithm  com¬ 
bines  the  information  in  both  domains  to  arrive  at  the  complete 
description  of  the  wavefront  or  signal.  Various  applications  are 
reviewed,  including  synthesis  of  Fourier  transform  pairs  having 
desirable  properties  as  well  as  reconstruction  problems.  Variations 
of  the  algorithm  and  the  convergence  properties  of  the  algorithm  are 
discussed. 
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1 .  INTRODUCTION 


There  exist  many  problems  that  are  very  difficult  to  solve  in 
astronomy,  X-ray  crystallography,  electron  microscopy,  spectroscopy, 
wavefront  sensing,  holography,  particle  scattering,  superresolution, 
radar  signal  and  antenna  synthesis,  filter  design,  and  other  disci-, 
plines  which  share  an  important  feature.  These  are  problems  which 
involve  the  reconstruction  or  synthesis  of  a  wavefront  (or  an  object 
or  a  signal,  etc.)  when  partial  information  or  constraints  exists  in 
each  of  two  different  domains.  The  second  domain  is  usually  the 
Fourier  transform  domain.  This  paper  describes  a  method  of  combining 
all  the  available  information  in  the  two  domains  to  arrive  at  a  com¬ 
plete  description  and  thereby  solving  the  problems. 

The  problems  fall  into  two  general  categories:  (1)  reconstruct 
the  entire  information  about  a  function  (an  image,  wavefront,  signal, 
etc.)  when  only  partial  information  is  available  in  each  of  two 
domains;  and  (2)  synthesize  a  (Fourier)  transform  pair  having  desir¬ 
able  properties  in  both  domains.  A  reconstruction  problem  arises 
when  only  partial  information  is  measured  in  one  domain,  and  in  the 
other  domain,  either  partial  information  is  measured  or  certain  con¬ 
straints  are  known  a  priori.  The  information  available  in  any  one 
domain  is  insufficient  to  reconstruct  the  function  or  its  transform. 

A  synthesis  problem  typically  arises  when  one  wants  the  transform  of 
a  function  to  have  certain  desirable  properties  (such  as  uniform 
spectrum,  low  sidelobes,  etc.)  while  the  function  itself  must  satisfy 
certain  constraints  or  have  certain  desirable  properties.  Because 
arbitrary  sets  of  properties  and  constraints  can  be  contradictory, 
there  may  not  exist  a  transform  pair  that  is  completely  desirable 
and  satisfies  all  the  constraints.  Nevertheless,  one  seeks  a  trans¬ 
form  pair  that  comes  as  close  as  possible  to  having  the  desirable 
properties  and  satisfying  the  constraints  in  both  domains. 

Both  the  reconstruction  and  the  synthesis  problems  can  be  ex¬ 
pressed  as  follows,  if  the  meaning  of  the  word  "constraints"  is 


broadened  to  include  any  kind  of  measured  data,  desirable  properties, 
or  a  priori  conditions: 

Given  a  set  of  constraints  placed  on  a  function 
and  another  set  of  constraints  placed  on  its 
transform,  find  a  transform  pair  (i.e.,  a  function 
and  its  transform)  that  satisfies  both  sets  of 
constraints. 

Once  a  solution  is  found  to  such  a  problem,  the  question  often 
remains:  is  the  solution  unique?  For  synthesis  problems,  the 
uniqueness  is  usually  unimportant  —  one  is  satisfied  with  any  solu¬ 
tion  that  satisfies  all  the  constraints;  often  a  more  important 
problem  is  whether  there  exists  any  solution  that  satisfies  what  may 
be  arbitrary  and  conflicting  constraints.  For  reconstruction  prob¬ 
lems,  the  uniqueness  properties  of  the  solution  are  of  central  impor¬ 
tance.  If  many  different  functions  satisfying  the  constraints  could 
give  rise  to  the  same  measured  data,  then  a  solution  that  is  found 
could  not  be  guaranteed  to  be  the  correct  solution.  The  question  of 
uniqueness  must  be  studied  for  each  problem.  Fortunately,  as  will 
be  described  later,  for  some  important  reconstruction  problems  the 
solution  usually  is  unique. 

An  effective  approach  to  solving  the  large  class  of  problems 
described  above  is  the  use  of  iterative  algorithms  related  to  the 
Gerchberg-Saxton  algorithm^.  The  algorithms  involve  the  iterative 
transformation  back  and  forth  between  the  two  domains,  repetitively 
applying  the  known  constraints  in  each  domain. 

The  basic  algorithm  is  presented  in  Section  2.  A  number  of  dif¬ 
ferent  applications  having  different  types  of  constraints  are 
described,  and  examples  are  shown  in  Section  3.  In  Section  4  the 
convergence  properties  of  the  algorithm  are  discussed,  and  improved 
versions  of  the  algorithm  are  reviewed.  A  brief  summary  and  comments 
are  included  in  Section  5. 
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2.  THE  BASIC  ITERATIVE  ALGORITHM 

The  first  published  account  of  the  iterative  algorithm  was  its 

use  by  Gerchberg  and  Saxton^  to  solve  the  electron  microscopy 

problem  in  which  both  the  modulus  of  a  complex-valued  image  and  the 

modulus  of  its  Fourier  transform  are  measured,  and  the  goal  is  to 

reconstruct  the  phase  in  both  domains.  Apparently  unknown  to  them, 

2 

it  was  invented  somewhat  earlier  by  Hirsch,  Jordan,  and  Lesem  to 
solve  a  synthesis  problem  for  computer-generated  holograms  which  has 
a  similar  set  of  constraints.  (This  will  be  described  later  in  more 
detail.)  The  method  was  again  reinvented  for  a  similar  problem  in 
computer  holography  by  Gallagher  and  Liu^.  The  fact  that  the 
algorithm  was  invented  repeatedly  testifies  to  its  simplicity  and 
effectiveness. 

2.1  The  Gerchberq-Saxton  Algorithm 

In  what  immediately  follows,  the  iterative  algorithm  is  described 
in  terms  of  its  application  to  the  electron  microscopy  reconstruction 
problem.  An  excellent  treatment  of  the  electron  microscopy  phase 
problem  and  its  solution  by  this  and  other  methods  can  be  found  in 
Reference  4.  Later  it  is  shown  how  to  apply  the  same  principles  to 
a  large  class  of  problems. 

Suppose  that  the  electron  wave  function  in  an  image  plane  is 
described  by  the  two-dimensional  complex-valued  function 

f(x)  .  lf(x)|  (1) 

Its  Fourier  transform,  the  wave  function  in  a  far-field  diffraction 
plane,  is  given  by 

F(u)  -  |F(u)|  -  g'LfCx)]  -  /"f(x)  dx  (2) 

where  x  and  u  are  the  vector  coordinates  in  the  spatial  (image) 
domain  and  the  spatial  frequency  (far-field  diffraction)  domain. 
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respectively.  The  notation  used  throughout  this  paper  is  that  func¬ 
tions  represented  by  capital  letters  are  the  Fourier  transforms  of 
the  functions  represented  by  the  corresponding  lower-case  letters. 

It  is  assumed  that  the  intensity  spatial  distributions  are  measured 
in  each  domain,  but  the  phase  information  is  lost.  Therefore  one 
wishes  to  reconstruct  i|)(x)  and  ©(x)  from  |f(x)|  and  |F(u)|. 

The  iterative  algorithm  for  solving  this  problem  is  depicted  in 
Figure  1.  One  iteration  (the  iteration)  of  the  algorithm  pro¬ 
ceeds  as  follows.  A  trial  solution  for  the  wave  function  (an  esti¬ 
mate  of  the  wave  function),  9|^(x),  is  Fourier  transformed  yielding 

G,^(u)  -  lG,^(u)l  exp  [i^,^(u)]  *  9^[g,^(x)]  (3) 

Then  a  new  Fourier-domain  function,  6j^(u),  is  formed  by  replacing 
the  computed  Fourier  modulus  by  the  measured  Fourier  modulus,  |F(u)|, 
and  keeping  the  computed  phase: 

G,^(u)  -  |F(u)|  exp  [i^,^(u)]  (4) 

The  resulting  6j|,(u),  which  is  in  agreement  with  all  the  known  mea¬ 
surements  and  constraints  in  the  Fourier  domain,  is  inverse  Fourier 
transformed  yielding  the  wave  function  gj^(x).  Then  the  iteration  is 
completed  by  forming  a  new  estimate  for  the  wave  function,  g|^+i(x), 
which  is  obtained  by  replacing  the  computed  modulus  of  g|^(x)  with  the 
measured  modulus  if(x)|,  and  keeping  the  computed  phase. 

The  algorithm  consists  of  no  more  than  enforcing  what  information 
is  available  on  the  wave  function,  Fourier  transforming,  imposing 
what  information  is  available  on  the  wave  function's  Fourier  trans¬ 
form,  inverse  transforming;  and  repeating  these  simple  operations 
for  a  number  of  iterations.  What  makes  the  algorithm  practical  is 

5 

the  existence  of  a  fast  Fourier  transform  (FFT),  so  that  the  num¬ 
ber  of  computations  per  iteration  goes  only  as  N  log  N,  where  N  is 
the  number  of  samples  of  the  function  computed.  This  compares  very 

4 

favorably  with  some  other  iterative  methods,  such  as  Newton-Raphson  , 

.  3 

for  which  the  number  of  computations  per  iteration  goes  as  N  . 
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A  measure  of  the  progress  of  the  iterations,  and  a  criterion  by 
which  one  can  determine  when  a  solution  has  been  found,  is  the  nor¬ 
malized  mean-squared  error,  which  is  defined  in  the  Fourier  domain  by 


00 


/[|G|^(u)|  -  |F(u)|]2  du 


or  in  the  image  domain  by 


E 


2 

0 


00 


I  [|g,;(x)i  -  \f{x)\f  dx 


/  lf(x)|^  dx 
^—00 


(5) 


(6) 


It  has  been  sh(,wh  that  the  algorithm  converges  in  the  sense  that  the 
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mean-squared  error  can  only  decrease  at  each  iteration.  *  *  The 
issue  of  convergence  will  be  discussed  in  greater  detail  in 
Section  4. 


2.2  The  Error-Reduction  Iterative  Algorithm 

It  is  now  known  that  with  slight  modifications  this  same  algo¬ 
rithm  can  be  applied  to  many  different  problems  having  a  variety  of 
available  constraints  or  measurements^.  Let  the  function  f(x) 
represent  a  .wave  front,  an  object,  a  signal,  an  antenna  array,  a 
spectral  density  function,  an  electron  density  function,  etc.,  where 
X  is  an  N-dimensional  vector  (spatial,  angular,  time,  etc.)  coordi¬ 
nate.  Depending  on  the  problem,  f(x)  may  be  complex  valued  or  real 
valued,  and,  if  real,  may  or  may  not  be  nonnegative.  Its  Fourier 
transform,  F(u),  is  given  by  Eq.  (2),  and  is  complex  valued  for  most 
problems.  The  N-dimensional  vector  u  is  a  (spatial,  angular,  time, 
etc.)  frequency  coordinate.  One  can  instead  consider  another  trans¬ 
formation  of  f(x),  such  as  the  Fresnel  transform,  which  has  been  done 
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for  more  than  one  problem.  ’  ’  For  simplicity  of  discussion,  the 


Fourier  transform  will  be  assumed,  but  the  reader  should  keep  in  mind 
that  what  is  said  also  applies  to  a  number  of  other  transformations 
as  well  (although  the  method  becomes  less  attractive  if  a  fast  trans¬ 
form  algorithm  is  not  available).  ^ 

With  only  slight  modifications,  the  Gerchberg-Saxton  algorithm 
can  be  used  to  solve  the  wide  class  of  problems  described  in  Sec¬ 
tion  1.  Referring  again  to  the  block  diagram  of  the  algorithm  in 
Figure  1,  all  that  is  required  is  to  impose  constraints  in  each 
domain  that  are  pertinent  to  the  problem  of  interest.  Just  as  for 
the  electron  microscopy  problem,  at  the  k^^  iteration,  g|^(x),  an 
estimate  of  f(x),  is  Fourier  transformed,  yielding  G|^(u)  given  by 
Eq.  (3).  Then  a  new  Fourier-domain  function  G|^(u)  is  formed  from 
6|^(u)  by  making  the  smallest  possible  changes  in  G|^(u)  that  allow  it 
to  satisfy  the  Fourier-domain  constraints.  For  example,  if  the 
Fourier  domain  constraint  is  that  the  Fourier  modulus  equals  |F(u)| 
over  some  region  of  the  Fourier  domain,  then  |F(u)|  is  substituted 
for  l6|^(u)|  in  that  region  of  the  Fourier  domain.  The  new  Fourier- 
domain  function  G|^(u),  which  satisfies  the  Fourier-domain  con¬ 
straints,  is  inverse  Fourier  transformed  to  yield  g|^(x).  To  complete 
one  iteration,  a  new  estimate  g|^+i(x)  is  formed  from  g|^(x)  by  making 
the  smallest  possible  changes  in  g^,(x)  that  allow  it  to  satisfy  the 
function-domain  constraints.  One  example  is  that  if  the  function  is 
complex  valued  and  it  is  constrained  to  have  a  modulus  equal  to 
lf(x)|  over  some  region  of  space,  then  |f(x)|is  substituted  for 
lg|<.(x)l  in  that  region.  A  special  case  of  this  is  when  the  func¬ 
tion  is  to  be  zero  outside  a  certain  interval  (then  the  Fourier  func¬ 
tion  is  bandlimited).  Another  example  is  that  if  the  function  is 
constrained  to  be  nonnegative,  then  g|^+i(x)  is  set  equal  to  g|[(x) 
for  those  x  where  g|^(x)  >  0,  and  g|^+i(x)  is  set  equal  to  zero  for 
those  X  where  g|^(x)  <  0.  In  summary,  one  transforms  back  and  forth 
between  the  two  domains,  forcing  the  function  to  satisfy  the  con¬ 
straints  in  each  domain. 
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For  reconstruction  problems,  whatever  characteristics  of  the 
actual  F(u)  and  f(x)  that  are  measured  or  are  known  a  priori  are 
imposed  on  G|^(u)  and  gj[(x),  respectively.  For  synthesis  problems, 
one  imposes  on  G|^{u)  and  g|J(x)  whatever  characteristics  one  might 
desire  F(u)  and  f{x),  respectively,  to  have.  Once  the  constraints 
are  defined,  the  algorithm  proceeds  the  same  for  synthesis  problems 
as  for  reconstruction  problems.  In  fact,  there  are  some  synthesis 
problems  that  are  mathematically  indistinguishable  from  some  recon¬ 
struction  problems,  and  are  handled  identically  by  the  algorithm. 


The  first  iteration  of  the  algorithm  can  be  started  in  a  number 
of  ways,  for  example,  by  setting  g-jCx)  or  #i(x)  equal  to  an  array  of 
random  numbers.  The  iterations  continue  until  a  Fourier  transform 
pair  is  found  that  satisfies  all  the  constraints  in  both  domains  to 
within  the  desired  accuracy  (or,  if  convergence  is  too  slow,  until 
one  loses  interest  or  the  money  runs  out).  The  mean-squared  error 
can  generally  be  defined  in  the  Fourier  domain  by 


j"lG^(u)  -  6'(u)l^  du 


/’|G,;(u)|' 


(7) 


du 


or  in  the  function  domain  by 


«o  ^ 

J  l9|,ti(x)  -  9i;(x)r  ‘‘x 


/■|9j(x)r 


(8) 


dx 


In  each  of  these  two  expressions,  the  integrand  in  the  numerator  is 
the  squared  modulus  of  the  amount  by  which  the  computed  function 
violates  the  constraints  in  that  domain.  It  is  easily  seen  that 
these  expressions  reduce  to  Eqs.  (5)  and  (6),  respectively,  for  the 
electron  microscopy  problem. 
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Just  as  in  the  electron  microscopy  problem,  for  problems  having 
other  sets  of  constraints  it  will  be  shown  in  Section  4  that  the 
algorithm  converges,  that  is,  the  error  decreases  at  each  successive 
iteration.  The  algorithm  depicted  in  Figure  1  may  be  referred  to  as 
the  "error  reduction"  algorithm  for  that  reason,  as  well  as  to  dis¬ 
tinguish  it  from  algo’^ithms  described  in  Section  4  which  are  related 
to  it  but  converge  faster.  Typically,  the  error  is  reduced  very 
rapidly  for  the  first  few  iterations  of  the  error-reduction  algo¬ 
rithm,  but  more  slowly  for  later  iterations.  For  some  applications, 
the  error-reduction  algorithm  has  been  very  successful  in  finding 
solutions  using  a  reasonable  number  of  iterations.  However,  for  some 
other  applications,  the  mean-squared  error  decreases  very  slowly  with 
each  iteration,  requiring  an  impractical ly  large  number  of  iterations 
for  convergence.  The  improved  algorithms  described  in  Section  4  do 
much  to  alleviate  this  problem. 


2.3  Alternative  Descriptions  of  the  Algorithm 


Once  a  solution  (i.e.,  a  Fourier  transform  pair  satisfying  all 
the  constraints  in  both  domains)  is  found,  the  error-reduction  algo¬ 
rithm  ceases  to  make  changes  to  the  estimate,  and  the  algorithm  locks 
onto  the  solution.  The  operations  of  enforcing  the  constraints  in 
each  domain  would  then  leave  the  function  estimate  and  its  Fourier 
transform  unaltered  since  they  already  satisfy  the  constraints.  Now 
let  us  define  the  operation  S[g(x)]  as  the  successive  Fourier  trans¬ 
formation  of  g(x),  followed  by  the  imposition  of  the  Fourier  domain 
constraints,  followed  by  inverse  Fourier  transformation,  followed  by 
imposition  of  the  object  domain  constraints.  That  is,  the  operation 
S  is  just  the  performance  of  one  iteration  of  the  error-reduction 
approach,  and 


g,^+l(x)  =  S[g,^(x)]. 


(9) 


From  the  discussion  above,  it  is  evident  that  any  solution  f(x)  must 
satisfy  the  relation 
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f(x)  -  S[f{x)]. 


(10) 


When  presented  in  this  form,  it  is  seen  that  the  error-reduction 
approach  is  a  particular  implementation  of  the  method  of  successive 
approximations.^® 

The  method  of  successive  approximations  can  be  more  easily  under¬ 
stood  from  the  following  simple  example.  Suppose  one  wishes  to  solve 
the  following  equation  for  y: 

4y^  _  4y  +  1  -  0  (11) 

4 

Based  on  the  relation  y  »  y  +1/4,  one  could  write 

Vl  -S,(y,)  -yj  -  1/4  (12) 

Using  the  method  of  successive  approximations  to  find  the  solution, 
one  would  pick  an  initial  estimate,  say  y^  «  0.1,  and,  employing 
Eq.  (12)  compute  y^  *  0.2501,  «  0.2539,  etc.,  and  rapidly  converge 

to  the  solution  y'  «  0.2541737...  .  However,  it  converges  to  y'  only 
for  y^  <  y"  «  0.8967902...  .  For  y^  >  y",  Eq.  (12)  diverges,  and  for 
y^  c:  y",  it  stays  at  y",  the  second  solution.  On  the  other  hand,  one 
could  just  as  logically  have  chosen 

A»1  -  ^2(^k>  •  ‘4  - 

This  second  form  converges  to  the  second  solution  y"  for  y^  >  y', 
diverges  for  y^  <  y',  an  stays  at  y'  for  y^  ■  y'.  Figure  2,  a  graph¬ 
ical  representation  of  Eq.  (12),  shows  the  two  solutions,  y'  and  y". 

4 

The  irregular  staircase  between  the  two  curves  y  and  y  +1/4  indi¬ 
cates  how  the  estimate  yj^  approaches  the  two  solutions.  Criteria 
on  the  derivative  of  S(y)  determine  whether  the  algorithm 
converges. 

The  error-reduction  algorithm,  as  described  by  Eqs.  (9)  and  (10), 
is  analogous  to  the  example  of  successive  approximations  described 
above,  except  that  instead  of  operating  on  a  scalar  y,  it  operates 
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on  a  function  g(x).  As  seen  from  the  example,  the  method  of  succes¬ 
sive  approximations  may  or  may  not  converge,  depending  on  the  partic¬ 
ular  form  chosen  and  on  the  initial  estimate.  Fortunately,  as  will 
be  discussed  further  in  Section  4,  the  error-reduction  approach  never 
diverges.  It  may,  however,  stagnate.  A  simple  example  of  stagnation 
is  shown  by  the  following.  In  solving  x  =  2  -  x  (which  has  the 
obvious  solution  x  *  1 )  by  the  method  of  successive  approximations, 
starting  with  the  initial  estimate  x^,  one  obtains  x^  =  2  -  x^,  x^  = 

2  -  (2  -  Xq)  =  Xq,  ...,  X2,^_i  =  2  -  Xq,  x^,^  <=  Xq,  etc.,  and  no 
progress  is  made  toward  the  solution. 

Another  way  of  understanding  the  error-reduction  algorithm, 
applicable  for  certain  sets  of  constraints,  is  the  alternating  pro¬ 
jection  of  the  function  onto  specified  subspaces  in  a  Hilbert 

IP 

space.  This,  along  with  the  possibility  of  closed-form  solu- 

1 3 

tions,  is  discussed  in  the  chapter  of  this  book  by  Marks  and 
Smith. 


3.  APPLICATIONS 


A  large  number  of  important  problems  in  optics  and  related  fields 
fit  the  problem  description  in  Section  1  and  can  be  solved  by  the 
iterative  algorithm  (by  the  error-reduction  algorithm  described  in 
Section  2  and  the  related  algorithms  described  in  Section  4).  One 
particular  application,  that  of  spectral  extrapolation  or  superreso¬ 
lution,  is  discussed  in  detail  in  the  chapter  in  this  book  by  Marks 
and  Smith.  In  this  section,  several  classes  of  applications  are 
listed,  followed  by  more  detailed  discussions  of  some  of  the  applica¬ 
tions,  including  examples. 

In  Section  1,  a  distinction  was  made  between  reconstruction 
problems  and  synthesis  problems.  Another  useful  way  to  classify  such 
problems  is  according  to  the  type  of  information  available.  For  one 
set  of  problems,  the  modulus  (magnitude  or  amplitude)  of  a  complex¬ 
valued  function  and  the  modulus  of  its  Fourier  transform  are  measured 
(or  are  given),  and  one  wishes  to  know  the  phase  of  the  Fourier 
transform  pair  in  both  domains.  These  include  the  phase  retrieval 
problem  in  electron  microscopy,  the  phase  retrieval  problem  in  wave- 
front  sensing,  the  design  optimization  of  radar  signals  and  antenna 
arrays  having  desirable  properties,  and  phase  coding  and  spectrum 
shaping  problems  for  computer-generated  holograms  and  other  applica¬ 
tions.  These  applications  often  involve  the  Fresnel  transform  for 
the  near-field  case  instead  of  the  Fourier  transform. 

For  another  set  of  problems,  the  function  is  known  to  be  real 
and  nonnegative  and  the  modulus  of  its  Fourier  transform  is  measured. 
These  include  the  phase  problems  of  X-ray  crystallography,  Fourier 
transform  spectroscopy,  imaging  through  atmospheric  turbulence  using 
interferometer  data,  and  pupil  function  determination. 

For  another  set  of  problems,  a  low-resolution  (i.e.,  a  low-pass 
filtered)  version  of  a  function  is  measured  (i.e.,  its  complex 
Fourier  transform  is  measured  only  over  a  certain  interval),  and  the 
function  is  known  to  have  a  finite  extent  (i.e.,  it  is  zero  outside 


of  some  known  region  of  support).  This  is  the  spectral  extrapolation 
or  superresolution  problem  for  band-limited  time  signals  or  for 
imaging  of  objects  of  finite  extent. 

For  another  set  of  problems,  the  function  is  known  to  be  nonnega¬ 
tive  and  of  finite  extent  and  its  complex  Fourier  transform  is  mea¬ 
sured  only  over  a  partially  filled  aperture.  These  include  the 
interpolation  of  the  complex  visibility  function  for  long  baseline 
radio  intrferometry  and  the  missing-cone  problem  in  X-ray  tomography. 

For  still  another  set  of  problems,  the  modulus  of  a  complex¬ 
valued  function  is  given,  and  one  wishes  to  find  an  associated  phase 
function  which  results  in  a  Fourier  transform  whose  complex  values 
fall  on  a  prescribed  set  of  quantized  complex  values.  These  include 
the  reduction  of  quantization  noise  in  computer-generated  holograms 
and  in  coded  signal  transmission. 

Another  problem  is  to  reconstruct  the  modulus  of  a  complex-valued 
function  from  the  phase  of  the  function,  given  the  fact  that  the 
Fourier  transform  of  the  function  has  finite  support. 

The  number  of  types  of  problems  solvable  by  the  iterative  algo¬ 
rithm  appears  to  be  limited  only  by  one's  ingenuity  in  defining  dif¬ 
ferent  combinations  of  information  that  might  be  available  in  each 
of  two  domains. 

3.1  Modulus — Modulus  Constraints 
Electron  Microscopy 

Among  the  applications  for  which  the  modulus  is  given  in  each  of 

two  domains,  the  electron  microscopy  phase  retrieval  problem  was  one 

of  the  earliest  applications  of  the  error-reduction  algorithm;  the 

electron  microscopy  problem  has  also  been  the  most  heavily 
1  4  8  14  15 

investigated  ’  ’  ’  ’  .  The  initial  discussion  of  the  algorithm 

in  Section  2  was  in  terms  of  the  electron  microscopy  problem  and  so 
it  will  not  be  repeated  here.  The  error-reduction  algorithm  has 


been  shown  to  perform  very  successfully  for  this  problem,  and  the 
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solution  is  usually  unique.  The  reader  is  referred  to  a  book  by 

4 

Saxton  for  a  thorough  review  of  this  problem. 

Spectrum  Shaping 

A  second  application  for  which  the  modulus  is  given  in  each  of 
two  domains  is  the  spectrum  shaping  problems.  Spectrum  shaping  is  a 
synthesis  problem  that  can  be  stated  as  follows:  given  the  modulus 
(magnitude)  ff(x)  |  of  a  coiTip lex-valued  wavefront,  g(x)  «  |f(x)|  exp 
[io(x)],  find  a  phase  function  ©(x)  such  that  l^(3r[g(x)]|  is  equal 
to  a  given  spectrum  |F(u)|.  Such  a  problem  is  the  one  suggested  by 
the  Escher  engraving  shown  in  Figure  3,  in  which  a  bird  transforms 
into  a  fish.  One  wishes  to  find  a  function  with  modulus  being  a 
picture  of  a  fish,  which  has  a  Fourier  transform  with  modulus  being 
a  picture  of  a  bird.  Or,  in  terms  of  computer  holography,  find  a 
phase  function  to  assign  to  the  image  of  a  fish  so  that  the  hologram 
will  look  like  an  image  of  a  bird.  Figure  4(a)  shows  the  actual 
"bird"  and  "fish"  binary  patterns  used  for  our  experiment.  For  the 
first  iteration,  the  fish  object  was  random  phase  coded,  Fourier 
transformed,  and  the  modulus  of  the  Fourier  transform  was  replaced 
with  the  modulus  of  the  bird  pattern  shown  in  Figure  4(a).  The 
result  was  inverse  Fourier  transformed,  yielding  the  very  noisy  out¬ 
put  image  shown  in  Figure  4(b).  The  iterative  algorithm  was  then 
used  for  seven  iterations,  resulting  in  the  improved  image  shown  in 
Figure  4(c).  For  this  example,  increasing  the  number  of  iterations 
resulted  in  a  further  improvement  of  the  quality  of  the  image;  that 
is,  a  Fourier  transform  pair  was  found  that  more  closely  satisfied 
the  constraints  in  both  domains. 

Spectrum  shaping  is  also  important  in  computer  holography  for 
reducing  quantization  noise.  The  objective  of  computer  holography^® 
is  to  synthesize  a  transparency  that  can  modulate  a  wavefront  accord¬ 
ing  to  a  calculated  wavefront,  often  corresponding  to  Fourier  coeffi¬ 
cients  (or  samples  of  the  Fourier  transform  of  an  image)  computed  by 


the  discrete  Fourier  transform.  Let  F  *  0^[f]  be  the  desired  wave- 
front  modulation  and  f  be  the  complex-valued  function  describing  the 
desired  image.  Due  to  the  limitations  of  the  display  devices  and 
materials  used  to  synthesize  computer  holograms,  it  is  often  not 
possible  to  exactly  represent  any  arbitrary  complex  Fourier  coeffi¬ 
cient.  An  extreme  example  of  this  is  the  kinoform^^  which  allows 
nearly  continuous  phase  control  which  is  accomplished  by  varying  the 
thickness  of  the  recording  medium,  but  quantizes  the  modulus  to  a 
single  level.  (If  a  gray-level  display  device,  or  recorder,  used  to 
synthesize  a  kinoform  has  a  finite  number  of  gray  levels,  then  the 
phase  is  quantized  as  well.)  The  desired  coefficient  F  is  only 
approximated  by  the  quantized  value  F/|F|.  Since  only  the  squared 
modulus  (the  intensity)  of  the  image  is  observed,  one  is  free  to  ' 
choose  the  phase  of  the  object  (phase  code  the  object)  in  such  a  way 
as  to  reduce  the  variance  (dynamic  range)  of  |F|,  which  reduces  the 
quantization  noise  in  kinoforms  and,  to  a  lesser  extent,  in  other 

types  of  computer-generated  holograms.  Random  phase  and  various 

18 

deterministic  phase  codes  cause  a  considerable  reduction  in  the 
variance  of  IfI,  but  substantial  errors  remain. 

It  was  for  the  kinoform  application  that  the  iterative  algorithm 
2  3 

was  first  invented.  ’  Figure  5  shows  an  example  of  its  use  for 
this  synthesis  problem.  Figure  5(a)  shows  the  image  resulting  when 
the  input  image  was  random  phase  coded,  encoded  as  a  kinoform  in  the 
Fourier  plane,  and  reconstructed  by  inverse  Fourier  transformation. 
The  ideal  image  would  be  the  binary  («0  or  1)  block  letters  SU. 
Figure  5(b)  shows  the  improved  result  after  eight  iterations  of  the 
iterative  algorithm.^  In  this  case,  the  image-domain  constraint 
is  that  the  modulus  equal  the  SU  pattern,  and  the  Fourier-domain 
constraint  is  that  the  modulus  equal  a  constant. 

A  problem  very  similar  to  the  kinoform  problem  is  that  of  syn¬ 
thesizing  a  quasi-random  radar  signal  having  good  autocorrelation 
properties.  Specifically,  one  would  like  to  synthesize  a  radar 
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signal  f(t)  which  is  a  pure  phase  function,  i.e.,  lf(t)|  «  1,  over 
some  interval  of  time  and  which  has  an  autocorrelation  function  which 
approaches  a  delta-function  (an  impulse),  and  therefore  its  Fourier 
spectrum  1  F(v)l  ^  is  constant  over  the  bandwidth  of  interest.  From 
the  examples  shown  above,  it  is  obvious  that  the  iterative  method 
would  be  an  effective  tool  for  synthesizing  such  radar  signals. 

Another  spectrum-shaping  application  is  the  phasing  of  elements 
of  an  array  of  antennas  in  order  to  achieve  a  far-field  pattern  hav¬ 
ing  desirable  properties.  For  example,  one  might  wish  to  phase  the 
antenna  elements  in  such  a  way  as  to  minimize  the  maximum  sidelobe 
of  the  far-field  pattern,  or  to  place  nulls  of  the  antenna  pattern 
at  several  different  prescribed  locations  simultaneously.  A  related 
application  for  which  the  iterative  method  has  been  used  is  the 

transformation  of  a  Gaussian  laser  beam  into  a  beam  having  a  more 

20 

rectangular  profile. 

Wavefront  Sensing 

The  wavefront  sensing  application  is  very  similar  to  the  electron 
microscopy  problem.  Suppose  that  one  measures  the  image  |f(x)p  of 
a  point  source  using  an  aberrated  optical  system,  where  the  aberra¬ 
tions  may  be  due  to  atmospheric  turbulence  or  due  to  the  optical 
system  itself.  Assuming  that  the  aberration  is  a  pure  phase  func¬ 
tion,  then  F(u),  the  Fourier  transform  of  f(x),  has  modulus  1f(u)| 
equal  to  the  aperture  function  of  the  optical  system.  The  problem 
is  to  reconstruct  the  phase  of  F(u)  given  1F(u)|  and  |f(x)|.  Several 
investigators  ’  ’  have  applied  the  error-reduction  algorithm  to 

this  problem  with  generally  good  results. 

3 . 2  Nonneqati vity — Modulus  Constraints 

For  some  reconstruction  problems,  the  physical  quantity  of  inter¬ 
est  can  be  represented  as  a  nonnegative  function,  and  one  is  able  to 
measure  only  the  modulus  of  its  Fourier  transform  (or  at  least  the 
measured  modulus  information  has  a  much  higher  signal-to-noise  ratio 
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than  the  measured  phase).  From  the  Fourier  modulus,  one  wishes  to 
reconstruct  (or  retrieve)  the  Fourier  phase  or,  equivalently,  the 
function  itself.  Since  the  autocorrelation  of  the  function  is  avail¬ 
able  as  the  inverse  Fourier  transform  of  the  squared  Fourier  modu- 
lus,  this  problem  is  equivalent  to  reconstructing  the  function 
from  its  autocorrelation.  This  problem,  referred  to  as  the  phase 
retrieval  problem  of  optical  coherence  theory,  arises  in  spectros- 
copy,  a  one-dimensional  problem;  in  astronomy,  a  two-dimensional 
problem;  and  in  X-ray  crystal lography,'-*'  a  three-dimensional 
problem.  In  spectroscopy,  the  nonnegative  spectral  density,  g(v), 
is  the  Fourier  transform  of  the  complex  degree  of  temporal  coherence, 
y(t),  of  which  lY('f)  I  is  most  easily  measured.  In  X-ray  crystallog¬ 
raphy,  the  nonnegative  electron  density  function,  p(x,  y,  z),  which 
is  periodic,  is  the  Fourier  transform  of  the  s,tructure  factor 
of  which  I  is  measured  by  a  diffractometer.  The  astronomy 
problem  will  be  described  in  more  detail  later. 

The  Uniqueness  of  Solutions 

For  the  one-dimensional  problem,  use  of  the  iterative  algorithm 

(or  any  other  method)  to  reconstruct  the  function  from  its  Fourier 

modulus  is  of  limited  interest  since  the  solution  in  the  general  case 
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is  usually  not  unique.  ’  The  uniqueness  of  the  solution  for 

the  one-dimensional  problem  can  be  analyzed  using  the  theory  of 

analytic  functions,  from  which  one  finds  that  additional  solutions 

can  be  generated  by  "flipping  zeros"  of  the  Fourier  transform 

analytically  extended  over  the  complex  plane.  ’  The  additional 

"solutions"  have  the  same  support  as  the  original  function,  but  are 

not  guaranteed  to  be  nonnegative;  therefore  one  could  reduce  the 

degree  of  ambiguity  by  generating  all  possible  "solutions"  and  then 

28 

keeping  only  the  nonnegative  ones. 


For  certain  special  types  of  one-dimensional  functions,  there  is 
a  high  probability  that  the  solution  is  unique.  For  a  function  hav¬ 
ing  two  separated  intervals  of  support,  being  separated  by  an 


interval  over  which  the  function  is  zero,  the  solution  usually  is 
29  30 

unique,  ’  but  only  if  the  two  intervals  of  support  are  suffi- 
31 

ciently  separated.  Another  special  type  of  function  for  which 
the  solution  is  usually  unique  is  one  consisting  of  a  summation  of  a 
number  of  delta-functions  randomly  distributed  in  space;  for  such 
functions,  one  does  not  need  the  iterative  method — they  can  be  recon¬ 
structed  by  a  simple  noniterative  method  involving  the  product  of 
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three  translates  of  the  autocorrelation  function. 

In  the  event  that  multiple  solutions  do  exist,  it  would  not 
appear  that  the  algorithm  would  be  biased  toward  one  over  another, 
and  one  would  expect  the  algorithm  to  converge  to  different  solu¬ 
tions,  depending  on  the  initial  input  to  the  algorithm.  For  example. 
Figure  6  shows  two  functions  having  the  same  Fourier  modulus.  In  a 
computer  experiment  using  the  iterative  reconstruction  algorithm  on 
the  functions'  Fourier  modulus,  it  converged  to  one  of  the  solutions 
in  about  half  of  the  trials  and  converged  to  the  other  solution  in 
the  other  half  of  the  trials,  depending  on  the  random  number 
sequences  used  as  the  initial  input  to  the  algorithm. 

For  the  problem  in  two  or  more  dimensions,  it  appears  that  the 

solution  is  usually  unique.  Considering  sampled  functions  defined 
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on  a  rectangular  grid  of  points,  Bruck  and  Sodin  showed  that  the 

existence  of  additional  solutions  is  equivalent  to  the  factorabil ity 

of  a  polynomial  representation  of  the  Fourier  transform.  Since  a 

polynomial  of  one  variable  of  degree  M  can  always  be  factored  into  M 

prime  factors,  there  are  2*^”^  solutions  in  the  one-dimensional 

case.  Once  again,  only  some  of  the  "solutions"  may  be  nonnegative. 

On  the  other  hand,  polynomials  of  two  or  more  variables  having 

arbitrary  coefficients  are  only  rarely  factorable;  consequently,  the 

two-dimensional  problem  is  usually  unique.  Attempts  have  also  been 

made  to  extend  this  concept  to  continuous,  as  opposed  to  discrete, 
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functions.  Although  it  is  always  possible  to  make  up  examples 
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in  two  dimensions  that  are  not  unique,  it  appears  to  be  true  that 
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for  two-dimensional  functions  drawn  from  the  real  world,  the  solution 

is  usually  unique.  The  general  uniqueness  of  the  two-dimensional 

case  is  indicated  by  experimental  reconstruction  results  using  the 

iterative  algorithm.  Furthermore,  noise  in  the  Fourier  modulus 

data  has  had  the  effect  of  adding  noise  to  the  reconstructed  function 

rather  than  causing  the  algorithm  to  converge  to  a  radically  differ- 
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ent  solution. 

Astronomical  Reconstruction 

The  problem  of  reconstructing  a  two-dimensional  nonnegative  func¬ 
tion  from  the  modulus  of  its  Fourier  transform  arises  in  astronomy. 
Due  to  atmospheric  turbulence,  the  resolution  attainable  from  large 
optical  telescopes  on  earth  is  only  about  one  second  of  arc,  many 
times  worse  than  the  diffraction  limit  imposed  by  the  diameter  of 
the  telescope  aperture.  For  a  five-meter  telescope  aperture,  the 
diffraction-limited  resolution  would  be  about  0.02  seconds  of 
arc— fifty  times  finer.  Despite  atmospheric  turbulence,  it  is  pos¬ 
sible  to  measure  the  modulus  of  the  Fourier  transform  of  a  space 
object  out  to  the  diffraction  limit  of  the  telescope  using  interfero¬ 
metric  techniques. The  autocorrelation  of  the  object  can  be 
computed  from  the  Fourier  modulus,  allowing  the  diameter  of  the 
object  to  be  determined.  However,  unless  the  Fourier  transform  phase 
is  also  measured,  it  was  previously  not  possible  to  determine  the 
object  itself,  except  for  some  special  cases.  Previous  attempts  to 
solve  this  problem  had  not  proven  to  be  practical  for  complicated 
two-dimensional  objects. 

The  problem  of  reconstructing  an  object  from  interferometer  data 
can  be  solved  by  the  iterative  method. The  Fourier-domain 
constraint  is  that  the  Fourier  modulus  equal  the  Fourier  modulus 
measured  by  an  interferometer,  and  the  function-domain  constraint  is 
that  the  object  function  be  nonnegative.  Figure  7  shows  an  example. 
Figure  7(a)  shows  a  compute r-s.ynthesi zed  object  used  for  the 
experiment— a  sun-like  disk  having  "solar  flares"  and  bright  and 
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dark  "sunspots."  The  modulus  of  its  Fourier  transform  is  shown  in 
Figure  7(b).  Figure  7(c)  shows  a  square  of  random  numbers  used  as 
the  initial  input  for  the  iterative  algorithm.  Figures  7(d),  7(e) 
and  7(f)  show  the  reconstruction  results  after  20,  230,  and  600 
iterations,  respectively.  Figure  7(g)  shows  the  initial  input  for  a 
second  trial,  and  the  reconstruction  results  after  2  and  215  itera¬ 
tions  are  shown  in  Figure  7(h)  and  7(i),  respectively.  Comparing 
Figures  7(f)  and  7(i)  with  the  original  object  in  Figure  7(a),  one 
sees  that  for  both  trials,  the  reconstructed  images  match  the  orig¬ 
inal  object  very  closely.  Note  that  inverted  solutions  such  as 
Figure  7(f)  are  permitted  for  this  problem  since  the  modulus  of  the 
Fourier  transform  of  f(-x)  equals  the  modulus  of  the  Fourier  trans¬ 
form  of  f(x).  Other  successful  reconstruction  experiments  have  been 

performed  on  data  simulated  to  have  the  types  of  noise  present  in 
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stellar  speckle  interferometry,  and  it  appears  that  under  realis¬ 
tic  levels  of  photon  noise  for  fairly  bright  objects,  diffraction- 

37 

limited  images  can  be  reconstructed.  Initial  experiments  have 

43 

also  been  carried  out  on  data  from  telescopes. 

Pupil  Reconstruction  and  Synthesis 

Another  case  in  which  one  may  want  to  reconstruct  a  two- 
dimensional  nonnegative  function  from  its  Fourier  modulus  is  in  pupil 
function  determination.  In  a  diffraction-limited  optical  system, 
the  point-spread  function  is  the  squared  Fourier  modulus  of  the  sys¬ 
tem's  pupil  function.  Equivalently,  the  optical  transfer  function 

44 

is  the  autocorrelation  of  the  pupil  function.  Given  the  point- 
spread  function  at  a  given  location  in  an  image  plane,  one  could  use 
the  interative  algorithm  to  retrieve  the  corresponding  pupil  func¬ 
tion,  in  a  way  which  is  mathematically  equivalent  to  the  astronomy 
problem.  Turning  this  problem  around,  one  could  use  the  iterative 
algorithm  to  synthesize  (design)  a  pupil  function  which  would  yield 
a  given,  desired  point-spread  function  while  possibly  satisfying 
other  desirable  constraits  as  well. 
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3.3  Finite  Extent — Measurement  Over  Part  of  an  Aperture 

In  a  number  of  reconstruction  problems,  there  is  a  function  of 
known  finite  extent  (or  support)  and  one  wishes  to  reconstruct  the 
function  with  resolution  appropriate  to  an  aperture  in  the  Fourier 
domain  more  complete  than  the  one  over  which  measurements  were 
actually  taken.  In  some  cases,  the  desired  aperture  is  simply  larger 
than  the  aperture  over  which  measurements  were  taken,  and  so  one 
wishes  to  extrapolate  the  function's  Fourier  transform,  or  equiva¬ 
lently,  to  obtain  superresolution  of  the  function.  In  other  cases, 
one  has  made  measurements  over  a  partially  filled  aperture,  in  which 
case  one  wishes  to  interpolate  the  Fourier  transform  of  the  function, 
or,  equivalently,  to  obtain  an  improved  impulse  response  in  the 
function  domain. 

Extrapolation  or  Superresolution 

The  error-reduction  algorithm  was  first  applied  to  the  extrapola- 

45 

tion  (or  superresolution)  problem  by  Gerchberg.  Much  has  been 

written  about  the  iterative  algorithm,  specifically  the  error- 

reduction  algorithm,  as  it  relates  to  this  problem,  including  various 

ways  of  understanding  the  algorithm  (see  the  end  of  Section  2)  and 

proofs  of  convergence. For  this  particular  problem, 

the  nature  of  the  constraints  makes  it  possible  to  implement  the 
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algorithm  by  a  feedback  optical  processor  ’  taking  on  the  order 

Q 

of  10  seconds  per  iteration  even  for  the  two-dimensional  case. 

The  chapter  in  this  book  by  Marks  and  Smith  describes  these  matters 
in  detail . 

Interpolation 

In  tomographic  imaging  systems,  many  projections  of  the  object 
are  measured,  each  projection  yielding  information  about  a  slice 
through  the  Fourier  transform  of  the  object.  When  measurements  over 
only  a  limited  cone  of  angles  are  made,  the  effective  aperture  in 
the  Fourier  domain  has  gaps,  and  the  impulse  response  of  the  system 
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is  highly  irregular.  In  applying  the  iterative  algorithm  to  this 
51  52 

problem  ’  ,  the  function-domain  constraint  is  the  finite  extent 

and  nonnegativity  of  the  object,  and  the  Fourier  domain  constraint 
is  that  the  Fourier  transform  equal  the  measured  Fourier  transform 
over  the  measurement  aperture. 

A  problem  very  similar  to  the  tomography  problem  arises  in  radio 

astronomy.  The  radio  sky  brightness  map  is  a  two-dimensional  real, 

nonnegative  function  which  is  the  Fourier  transform  of  the  complex 

visibility  function.  The  visibility  function  is  measured  by  radio 

interferometry,  and  in  the  case  of  long-baseline  interferometry,  the 

visibility  function  is  measured  only  over  a  limited  set  of  "tracks" 

in  the  visibility  plane,  resulting  in  a  partially  filled  effective 

aperture.  The  error-reduction  algorithm  has  been  used  to  obtain 

improved  maps  by,  in  effect,  interpolating  the  visibility  function 
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to  fill  in  the  area  between  the  tracks.  For  this  problem,  the 
constraints  on  the  brightness  map  are  that  it  be  nonnegative  and  be 
zero  outside  the  known  field  of  view.  In  the  visibility  plane,  the 
constraint  is  that  the  complex  visibility  function  equal  the  measured 
value  within  the  area  of  the  tracks. 

3.4  Modulus — Quantized  Values 

As  mentioned  earlier  in  connection  with  spectrum  shaping,  in  com¬ 
puter  holography  one  may  wish  to  encode  the  Fourier  transform  of  an 
image  as  a  computer-generated  hologram,  but  some  types  of  computer¬ 
generated  holograms  can  encode  only  certain  quantized  complex  values. 

The  kinoform  example  discussed  earlier  is  a  special  type  of  quantize- 

whicH 

tion.  A  more  general  example  is  the  Lohmann  hologram, 

the  modulus  and  phase  of  a  complex  sample  are  determined  by  the  area 

and  relative  position,  respectively,  of  an  aperture  within  a  sampling 
cell.  The  number  of  allowable  quantized  values  is  determined  by  the 
number  of  resolution  elements,  of  the  recording  device  used  to  fabri¬ 
cate  the  hologram,  used  to  form  one  cell.  For  this  synthesis 


problem,  the  function-domain  constraint  is  that  the  modulus  of  the 
function  equal  the  desired  image  modulus  and  the  Fourier-domain  con¬ 
straint  is  that  the  complex  Fourier  coefficients  fall  on  a  prescribed 
set  of  quantized  values.  Experiments  have  shown  that  synthesizing 

such  a  Fourier  transform  pair  is  possible  using  the  iterative  algo- 
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rithm.  For  example,  Figure  8a  shows  a  simulation  of  an  image 
produced  by  a  Lohmann  hologram  having  only  four  modulus  and  four 
phase  quantization  levels  when  the  image  was  random  phase  coded. 
Figure  8b  shows  the  image  after  13  iterations,  a  considerable  im¬ 
provement.  This  problem  is  one  of  a  more  general  class  of  problems 
regarding  the  transmission  of  coded  data. 

3.5  Finite  Extent — Phase 

Finally,  the  iterative  algorithm  has  been  used  to  reconstruct 
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the  modulus  of  a  band-limited  signal  from  its  phase.  ’  Or, 

looking  at  it  in  another  way,  given  that  a  function  has  finite  extent 

and  given  the  phase  of  its  Fourier  transform,  reconstruct  the  modulus 

of  its  Fourier  transform.  For  this  application,  it  has  been  shown 
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that  for  a  wide  class  of  conditions  the  solution  is  unique.  This 
application  will  be  discussed  further  in  Section  4. 


4.  ALGORITHM  CONVERGENCE  AND  ACCELERATED  ALGORITHMS 


As  mentioned  in  Section  2,  the  basic  interative  algorithm  de¬ 
picted  in  Figure  1,  referred  to  as  the  error-reduction  algorithm, 
has  been  shown  to  converge  for  some  applications.  In  this  section, 
the  convergence  is  proven  for  all  appliations.  In  addition,  modified 
algorithms  which  in  practice  often  converge  much  faster  than  the 
error-reduction  algorithm  are  discussed. 

4.1  Convergence  of  the  Error-Reduction  Algorithm 

The  mean-squared  error  can  be  defined  in  general  by  Eqs.‘  (7)  or 
(8).  It  is  a  normalized  version  of  the  integral  over  the  square  of 
the  amount  by  which  the  computed  function  (or  the  computed  Fourier 
transform)  violates  the  constraints  in  the  appropriate  domain.  When 
the  mean-squared  error  is  zero,  then  a  Fourier  transform  pair  has 
been  found  that  satisfies  all  the  constraints  in  both  domains. 

Consider  again  the  steps  in  the  error-reduction  algorithm 
described  in  Section  2.  The  iteration  starts  with  an  estimate 
gi^(x)  for  the  function  which  satisfies  the  function-domain  con¬ 
straints.  Figure  9(a)  depicts  a  phasor  diagram  for  the  complex¬ 
valued  g|^(x)  at  a  particular  coordinate  x.  A  dot,  at  which  the 
phasor  is  pointing  in  Figure  9(a),  was  included  as  an  indication 
that  g|^(x)  is  a  complex  value  that  satisfies  the  function-domain 
constraints.  The  function  estimate  g|((x)  is  Fourier  transformed, 
yielding  G|^(u),  which,  for  a  particular  coordinate  u,  is  depicted 
by  the  phasor  diagram  in  Figure  9(b).  The  next  step  in  the  algorithm 
is  to  change  G|((u)  by  the  smallest  possible  amount  that  allows  it 
to  satisfy  the  Fourier-domain  constraints,  thereby  forming  G|^(u). 
Again  a  dot  is  included  in  Figure  9(b)  at  G^(u)  as  an  indication 
that  it  is  a  complex  value  at  which  the  Fourier-domain  constraints 
are  satisfied.  G^(u)  is  the  inverse  Fourier  transformed,  yielding 
g|^(x)  in  the  function  domain  [see  Figure  9(a)].  Now  consider  the 


unnortnal ized  squared  error,  given  by  the  numerators  in  Eqs.  (7)  and 
(8).  In  the  Fourier  domain,  the  unnormalized  squared  error  at  the 
iteration  is 


®Fk  =  /  du 

(14) 

CD 

'  /  ■  9k 1^  dx 


where  the  second  line  in  this  equation  results  from  ParsevaTs 
theorem.  Referring  to  Figure  9(a),  at  a  particular  coordinate  x, 
g|^(x)  violates  the  function-domain  constraints  by  an  amount  no 
greater  than  |g|^(x)  -  g|^(x)|.  The  complex  values  that  g(x)  can 
have  that  satisfy  the  function-domain  constraints  form  some  set  of 
points  in  phasor  space.  For  example,  if  the  modulus  must  equal 
|f(x)|,  then  the  set  of  such  points  is  a  circle  of  radius  |f(x)|  in 
phasor  space;  if  the  function  must  be  nonnegative,  then  the  set  of 
such  points  is  the  half  line  on  the  positive  real  axis.  The  unnor¬ 
malized  squared  error  in  the  function  domain  at  the  k^^  iteration 
is  given  by 

®0k  “  I  l9k+l^^)  ■  9k(x)  1^  dx  (15) 

where  the  complex  value  g|^+i(x)  for  a  given  coordinate  x  is  the 
point  in  phasor  space  satisfying  the  function-domain  constraints  that 
is  closest  to  g||(x)*  Since  also  g|^(x)  is  a  point  in  phasor  space 
satisfying  the  function-domain  constraints, 

ig|^+l(x)  -  g|^(x)|  <  |g^(x)  -  (16) 

where  equality  holds  only  if  g|^(x)  is  just  as  close  in  phasor  space 
to  g||(x)  as  g|^+i(x)  is.  When  there  is  a  point  in  phasor  space  satis¬ 
fying  the  constraints  that  is  closer  to  g,|(x)  than  g,^(x)  is,  then 
the  left-hand  side  of  the  expression  above  is  strictly  less  than  the 
right-hand  side.  Therefore,  combining  Eqs.  (14)  -  (16), 
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for  the  first  half  a  given  iteration.  From  the  perfect  symmetry  of 
the  error-reduction  algorithm,  as  seen  from  Figure  1,  a  similar 
result  holds  when  one  completes  the  iteration  by  satisfying  the 
function -domain  constraints,  thereby  forming  g|^+i(x),  and  continu¬ 
ing  the  next  iteration  by  Fourier  transforming  gj^+^(x).  One  then 
finds  that 

®F,k+l  -  -  ®Fk 

Therefore,  the  unnormalized  squared  error  can  only  decrease  (or  at 
least  not  increase)  at  each  iteration.  Since  the  normalized  mean- 
squared  error  is  simply  proportional  to  the  unnormalized  squared 
error,  a  similar  result  holds  for  the  errors  defined  by  Eqs.  (7)  and 
(8). 

While  the  error-reduction  algorithm  converges  to  a  solution 
sufficiently  fast  for  some  applications,  it  is  unbearably  slow  for 
others.  In  most  cases,  the  error  is  reduced  rapidly  for  the  first 
few  iterations,  and  then  much  more  slowly  for  later  iterations. 

4.2  Input-Output  Algorithms 

Resulting  from  an  investigation  into  the  problem  of  the  slow 

convergence  of  the  error-reduction  algorithm,  a  new  and  faster- 

converging  algorithm  was  developed,  the  input-output  algo- 
55  7  36  42 

rithm.  ’  ’  ’  The  input-output  algorithm  differs  from  the 

error-reduction  algorithm  only  in  the  function-domain  operation. 

The  first  three  operations — Fourier  tranforming  g(x),  satisfying 
Fourier  domain  constraints,  and  inverse  Fourier  transforming  the 
result — are  the  same  for  both  algorithms.  Those  three  operation,  if 
grouped  together  as  shown  in  Figure  10,  can  be  considered  as  a  non¬ 
linear  system  with  an  input  g(x)  and  an  output  g'{x).  A  property  of 
this  system  is  that  its  output  is  always  a  function  having  a  Fourier 
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transform  that  satisfies  the  Fourier-domain  constraints.  Therefore, 
if  the  output  also  satisfies  the  function-domain  constraints,  then 
all  the  constraints  are  satisfied  and  it  is  a  solution  to  the  prob¬ 
lem.  Then  it  is  necessary  to  determine  how  to  manipulate  the  input 
in  such  a  way  as  to  force  the  output  to  satisfy  the  function  domain 
constraints. 

For  the  error-reduction  algorithm,  the  next  input  g(x)  is  chosen 
to  be  the  current  best  estimate  of  the  function,  satisfying  the 
function-domain  constraints.  However,  for  the  input-output  algo¬ 
rithm,  the  input  is  not  necessarily  an  estimate  of  the  function  or  a 
modification  of  the  output,  nor  does  it  have  to  satisfy  the  con¬ 
straints;  instead,  it  is  viewed  as  the  driving  function  for  the  next 
output.  This  viewpoint  allows  one  a  great  deal  of  flexibility  and 
inventiveness  in  selecting  the  next  input,  and  allows  the  invention 
of  an  algorithm  that  converges  more  rapidly  to  a  solution.  As  will 
be  seen  later,  the  "input-output  algorithm"  actually  comprises  a  few 
different  algorithms,  all  of  which  are  based  on  the  input-output 
point-of-view. 

How  the  input  should  be  changed  in  order  to  drive  the  output  to 
satisfy  the  constraints  depends  on  the  particular  problem  at  hand. 

The  analysis  given  in  the  Appendix  to  this  chapter  for  a  specific 
application  an  be  generalized  as  follows.  Consider  what  happens  when 
an  arbitrary  change  is  made  in  the  input.  Suppose  that  at  the 
iteration,  the  input  g|^(x)  results  in  the  output  g|^(x)*  Further, 
suppose  that  the  input  is  then  changed  by  adding  ag(x): 

g)^+l(x)  -  g,^(x)  +  ag(x)  (19) 

Then  one  would  expect  the  new  output  resulting  from  g|^+-j(x)  to  be 
of  the  form 

gj^+l(x)  ■  9k(^)  »^9(^)  *  additional  noise  (2C) 
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That  is,  the  expected  (or  statistical  mean)  value  of  the  change  of 
the  output,  due  to  the  change  ag(x)  of  the  input,  is  oag(x),  a  con¬ 
stant  times  the  change  of  the  input.  The  system  shown  in  Figure  10 

is  not  linear;  nevertheless,  small  changes  of  the  input  tend  to 
result  in  similar  changes  of  the  output.  The  expected  value  of  the 
change  of  the  output  can  be  predicted,  but  its  actual  value  cannot 
be  predicted  since  it  has  a  non-zero  variance.  In  the  equation 
above,  this  lack  of  predictability  is  indicated  by  the  "additional 
noise"  term.  The  constant  a  depends  on  the  statistics  of  6|^(u) 
and  F(u)  and  on  the  Fourier-domain  constraints. 

If  the  output  g|^(x)  does  not  satisfy  the  function-domain  con¬ 
straints  and  if  g|^(x)  +  Agj(x)  does,  then  one  might  try  to  drive  the 
output  to  satisfy  the  constraints  by  changing  the  input  in  such  a  way 
as  to  cause  the  output  to  change  by  ag^{x).  According  to  the  equa¬ 
tion  above,  the  change  of  the  input  that  will,  on  the  average,  cause 

a  change  Agj(x)  of  the  output  is 

ag(x)  *  a"^Agjj{x)  (21) 

Then  a  logical  choice  for  the  new  input  is 

%+-\M  =  *  6Agjj(x)  (22) 

where  B  is  a  constant  ideally  equal  to  o”\  and  where  agjj(x)  is  a 
function  such  that  g|^(x)  +  Agj(x)  satisfies  the  function-domain  con¬ 
straints.  If  a  is  unknown,  then  a  value  of  b  only  approximately 
equal  to  will  usually  work  nearly  as  well.  The  use  of  too  small 
a  value  of  b  in  Eq.  (22)  will  only  cause  the  algorithm  to  converge 
more  slowly.  The  noise-like  terms  in  Eq.  (20)  are  kept  to  a  minimum 
by  minimizing  Bag^(x)  . 

As  mentioned  earlier,  for  the  input-output  algorithm  g|j(x)  is 
rot  necessarily  an  estimate  of  the  function;  it  is  instead  the  driv¬ 
ing  function  for  the  next  output.  Therefore,  it  does  not  matter 
whether  its  Fourier  transform,  6|^(u),  satisfies  the  Fourier  domain 


constraints.  Consequently,  for  the  input-output  algorithm,  the 

2  2 

mean-squared  error,  Ep,  is  unimportant;  is  the  meaningful 
quality  criterion.  When  computing  E^  for  the  input-output  algorithm, 
the  g|^+^(x)  that  one  should  use  in  the  integrand  of  Eq.  (8)  is  the 
one  determined  by  the  error-reduction  algorithm  rather  than  the  one 
computed  by  the  input-output  algorithm.  That  is,  E^^  should  still 
be  a  measure  of  the  amount  by  which  the  output,  g|^(x),  violates  the 
constraints. 

Another  interesting  property  of  the  system  shown  in  Figure  10  is 
that  if  an  output  g'(x)  is  used  as  an  input,  then  its  output  will  be 
itself.  Since  the  Fourier  transform  of  g'(x)  already  satisfies  the 
Fourier-domain  constraints,  g'(x)  is  unaffected  as  it  goes  through 
the  system.  Therefore,  no  matter  what  input  actually  resulted  in 
the  output  g‘(x),  the  output  g'(x)  can  always  be  considered  to  have 
resulted  from  itself  as  an  input.  From  this  point  of  view,  another 
logical  choice  for  the  new  input  is 

g|^+l{x)  *  g,^(x)  +  eAgj(x)  (23) 

Note  that  if  6  =  1  in  Eq.  (23),  then  this  version  of  the  input- 
output  algorithm  reduces  to  the  error-reduction  algorithm.  Since 
the  optimum  value  of  a  is  usually  not  unity,  the  error-reduction 
algorithm  can  be  looked  on  as  a  sub-optimal  subset  of  one  version  of 
the  more  general  input-output  algorithm.  Depending  on  the  problem 
being  solved,  other  variations  in  Eqs.  (22)  and  (23)  may  be  success¬ 
ful  ways  for  choosing  the  next  input. 

In  order  to  implement  the  input-output  algorithm  using  Eq.  (22) 
or  (23),  one  chooses  Ag^(x)  according  to  the  function-domain  con¬ 
straints.  In  general,  a  logical  choice  is  the  smallest  value  of 
Ag£j(x)  allowing  g|^{x)  +  &g(j(x)  to  satisfy  the  function-domain  con¬ 
straints.  At  those  values  of  x  for  which  gj^(x)  already  satisfies  the 
function-domain  constraints,  one  would  set  Agjj(x)  «  0.  At  those 
value  of  X  for  which  gJ|j(x)  violates  the  function-domain  constraints. 


A 
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examples  of  logical  choices  of  ag^(x)  for  various  applications  are  as 
follows.  For  the  astronomy  problem  and  other  applications  requiring 
the  function  to  be  nonnegative,  choose  ag^{x)  »  “9k(^)  where  g)^'(x)  is 
negative.  For  applications  requiring  the  function  to  be  of  finite 
extent,  choose  ag^{x)  «  ^  outside  the  known  region  of 

support.  For  applications  requiring  the  function  to  have  modulus  equal 
to  lf{x)|,  choose 

9|!{x) 


In  addition  to  the  values  of  Ag^{x)  given  above,  there  are  other 
choices  that  are  successful  when  used  in  Eqs.  (22)  and  (23).  Any 
ag^{x)  that  moves  g'(x)  in  the  general  direction  of  satisfying  the 
function-domain  constraints  will  usually  result  in  an  algorithm  that 
works;  sub-optimum  choices  of  ag^(x)  and  of  0  in  Eq.  (22)  or  (23) 
result  in  algorithms  that  converge  less  rapidly  than  the  optimum. 

Two  examples  of  other  algorithms  that  converge  more  rapidly  than  the 
"logical"  ones  described  in  the  preceding  paragraph  are  as  follows. 
For  applications  requiring  the  function  to  have  modulus  equal  to 
lf{x)l,  it  was  noticed  that  the  difference  in  phase  between  g|^{x) 
and  g|^{x)  tends  to  have  the  same  sign  as  the  change  of  phase  of 
g||,{x)  from  one  iteration  to  the  next.  In  order  to  anticipate  the 
direction  that  the  phase  is  changing,  one  could  choose  a  ag^jlx) 
that  tends  to  rotate  the  phase  angle  of  the  new  input  toward  that  of 
the  last  output.  That  is,  a  good  choice  for  the  desired  change  in 
the  output  is 


in  which  the  first  component  boosts  (or  shrinks)  the  magnitude  of 
the  output  to  match  |f(x)(  and  the  second  component  rotates  the  phase 
angle  of  the  input  toward  the  phase  angle  of  the  output.  For  the 


K 

f 
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astronomy  problem,  it  was  found  that  a  particularly  successful  algo¬ 
rithm  was  to  \ise  Eq.  (23)  at  those  points  where  the  constraints  were 
satisfied  and  use  Eq.  (22)  at  those  points  where  the  constraints  were 
violated,  i.e.. 


gMx),  where  constraints  satisfied 

(26) 

^gi^(x)  -  6g|^(x),  where  constraints  violated 


Furthermore,  it  was  found  that  even  faster  convergence  was  obtained 
by  alternating  between  the  above  equation  and  the  error-reduction 
algorithm  every  few  iterations. 

From  the  paragraphs  above,  it  is  seen  that  the  "input-output 
algorithm"  is  really  a  family  of  algorithms.  The  input-output 
approach  is  one  that  can  lead  to  a  number  of  different  algorithms 
based  on  the  manner  in  which  the  nonlinear  system  of  Figure  10  be¬ 
haves.  One  would  hope  that  the  principles  of  control  theory  and 
possibly  other  disciplines  could  be  used  to  shed  further  light  on 
this  system  and  help  to  arrive  at  algorithms  with  still  more  rapid 
convergence. 

It  should  also  be  noted  that,  unlike  the  error-reduction  algo¬ 
rithm,  the  input-output  algorithm  does  not  treat  the  two  domains  in 
a  symmetric  manner.  By  reversing  the  roles  of  the  two  domains,  one 
can  arrive  at  a  different  and  possibly  more  advantageous  algorithm. 


4.3  Relaxation-Parameter  Algorithm 

A  second  method  of  improved  convergence  is  the  use  of  a  relaxa¬ 
tion  parameter.  In  solving  the  problem  of  reconstructing  the  magni¬ 
tude  of  a  band-limited  function  from  its  phase  (or,  equivalently, 
reconstructing  a  function  of  finite  extent  from  the  phase  of  its 
Fourier  transform),  Oppenheim,  Hayes,  and  Lim^^  modified  the  error- 
reduction  algorithm  (Figure  1)  by  adding  a  relaxation  step,  as  shown 
in  Figure  11.  Here  the  band-limited  function  is  taken  to  be  in  the 
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Fourier  domain.  The  function  g(x)  then  must  be  of  finite  extent 
according  to  the  bandwidth  of  the  Fourier-domain  function.  In  the 
relaxation  step,  gj^(x)  is  formed  from  gj^(x)  according  to 

g^'(x)  -  (1  -  n,^)gj;_i(x)  +  i),^g|^(x)  (27) 

and  then  the  new  estimate  g|^+^(x)  is  formed  from  gJJ!(x)  by  making  it 
satisfy  the  function-domain  constraints.  The  parameter  which  is 
a  constant  which  may  vary  from  one  iteration  to  the  next,  is  the 
relaxation  parameter.  For  «  1,  g||.'(x)  «  g|^(>^)  and  this  reduces  to 
the  error-reduction  approach.  For  n|^  «  0,  gjj!(x)  -  gjj.'_i(x),  that  is, 
the  result  from  the  previous  iteration  is  used.  Other  values  of  n|^ 
give  a  linear  combination  of  g|J!_i(x)  and  gj^(x).  For  the  reconstruc¬ 
tion  of  a  function  of  finite  extent  from  the  phase  of  its  Fourier 
transform  or  from  a  segment  of  its  Fourier  transform  (i.e.,  the 
superresolution  problem),  if  gi‘(x)  and  g^Cx)  both  satisfy  the 
Fourier-domain  constraint,  then  the  linear  combination  ngi'lx)  + 

(1  -  n)g2(x)  also  satisfies  the  constraint  in  the  Fourier  domain, 
follows  from  this  that  g|J.'(x)  given  by  Eq.  (27)  also  satisfies  the 
Fourier-domain  constraint.  In  those  cases,  it  can  be  shown  that  the 
algorithm  converges  for  0  <  t)|^  <  1*  However,  for  other  sets  of 
constraints,  for  example,  given  the  modulus  of  the  Fourier  transform, 
g|J.'(x)  given  by  the  equation  above  does  not  generally  satisfy  the 
Fourier-domain  constraints  and  so  the  relaxation  method  does  not 
strictly  apply. 

The  optimum  value  of  H|^  can  be  determined  as  follows.  Define 
the  function-domain  squared  error  after  the  relaxation  step  as 


It 


/lg;:(x)r 


dx 


(28) 


where  the  region  of  integration,  y,  is  the  region  over  which  the 
function  Is  known  to  be  zero.  Setting  equal  to  zero  the  derivative 
of  e^  with  respect  to  rij^,  and  solving  for  t>,^,  one  finds  the  opti¬ 
mum  value  of  to  be  given  by 
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Use  of  the  relaxation  step  for  the  problem  of  reconstructing  a 
band-limited  function  from  its  phase  resulted  in  an  order  of  magni¬ 
tude  improvement  in  the  speed  of  convergence  of  the  algorithm  over 
that  of  the  error-reduction  algorithm. 

The  relaxation  step  described  above  incorporates  the  optimum 
combination  of  the  current  output  with  the  previous  output.  It  is 
also  possible  to  extend  this  concept  to  include  a  number  of  previous 
outputs, which  may  result  in  still  more  rapid  convergence. 

It  should  be  noted  that  the  majority  of  the  work  referenced  in 
Section  3  made  use  of  only  the  error-reduction  algorithm.  Improved 
speed  of  convergence  could  be  expected  if  one  of  the  two  accelerated 
algorithms  discussed  above  were  employed. 


5.  SUMMARY  AND  COMMENTS 


The  iterative  error-reduction  algorithm,  an  extension  of  the 
Gerchberg-Saxton  algorithm  to  include  various  types  of  constraints, 
has  been  found  to  be  capable  of  solving  a  wide  range  of  difficult 
problems  in  optics  and  other  fields.  It  can  be  applied  to  the  recon¬ 
struction  of  a  function  (an  object,  wavefront,  signal,  etc.)  when 
only  partial  information  is  available  in  each  of  two  domains,  or  to 
the  synthesis  of  a  function  (wavefront,  signal,  etc.)  having  desired 
properties  in  each  of  two  domains.  The  iterative  algorithm  is  rea¬ 
sonably  fast  for  most  applications,  since  the  major  computational 
burden,  two  Fourier  transforms  per  iteration,  can  be  accomplished 
using  the  fast  Fourier  transform  (FFT)  algorithm.  The  iterative 
algorithm  has  been  shown  to  outperform  alternative  methods  of  solving 

these  classes  of  problems  both  because  of  its  speed  and  its  tolerance 
4  9 

of  noise.  ’  For  some  applications,  a  large  number  of  iterations 
is  required  for  convergence  of  the  error-reduction  algorithm.  This 
situation  can  be  remedied  by  using  an  algorithm  with  accelerated 
convergence,  such  as  the  input-output  algorithm  or  an  algorithm  em¬ 
ploying  a  relaxation  step. 

The  iterative  algorithm  has  been  in  use  for  only  a  few  years  and 
yet  already  it  has  already  found  numerous  applications;  and  methods 
of  improving  the  algorithm  have  been  devised.  Nevertheless,  it  is 
safe  to  predict  that  it  will  be  used  in  the  future  to  solve  new  prob¬ 
lems  not  discussed  here,  and  it  is  hoped  that  further  improvements 
of  the  algorithm  will  be  discovered. 
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APPENDIX 


Consider  the  synthesis  problem  for  kinoforms,  for  which  the 
Fourier  modulus  is  set  equal  to  a  constant.  Suppose  that  the  input 
g(x)  to  a  kinoform  system  results  in  the  output  g'(x).  The  kinoform 
has  a  transmittance  G'(u)  -  K  exp  [id(u)]  where  ^(u)  is  the  phase  of 
G(u)  =  |G(u)|  exp  [id(u)]  »  ^[g(x)],  and  K  is  a  constant.  The 
resulting  image  is  g'(x)  «  ‘3*~^CG' (u)].  Now  consider  what  happens 
when  a  change  ag(x)  is  made  in  the  input.  As  illustrated  in  the 
phasor  diagrams  in  Figure  Al,  the  change  ag(x)  of  the  input  causes  a 
change  aG(u)  of  its  Fourier  transform,  which  causes  a  change  aG'(u) 
of  the  kinoform  and  a  corresponding  change  ag'(x)  =  ^“^[aG'(u)] 
of  the  output  image.  The  goal  here  is  to  determine  the  relationship 
between  the  change  ag'(x)  of  the  output  and  the  change  ag(x)  of  the 
input.  Figure  A2  shows  the  relationship  between  aG'(u)  and  two 
orthogonal  components  of  aG(u).  By  similar  triangles,  for  | aG|  « 
iGl, 


aG'(u)  *=  aG^(u)i^^  (Al) 

where  the  two  orthogonal  components  of  aG(u)  are 

aG'‘(u)  -  |aG(u)|  cos  e(u)  (A2) 

parallel  to  G(u),  and 

aG^(u)  -  |aG(u)|  sin  b(u)  (A3) 

orthogonal  to  G(u);  and 

aG(u)  -  aG'‘(u)  +  aG^(u)  -  UG(u)|  (A4) 


where  b(u)  is  the  angle  between  aG(u)  and  G(u).  Only  one  of  the  two 
orthogonal  components  of  aG(u),  namely  aG^(u),  contributes  to 
aG' (u). 

In  order  to  compute  the  expected  change  of  the  output,  E[Ag'(x)], 
treat  the  phase  angles  e(u)  and  the  magnitudes  |G(u)|  as  random  vari¬ 
ables.  Inserting  |aG(u)|  from  Eq.  (A4)  into  Eq.  (A3),  one  gets 


ag‘(u)  .  aG(u)  cos  A(U) 

2 

=  AG{u)[sin  e(u)  +  1  sin  b(u)  cos  6(u)]  (A5) 

1 9 

For  6{u)  uniformly  distributed  over  [0,  2n),  the  expected  value 
of  aG^{u)  is 

E[aG^{u)]  m  aG(u)^-  i*^  «  aG(u)  (A6) 

Therefore,  the  expected  value  of  the  change  of  the  output  is,  using 

Eqs.  (Al)  and  (A6)  and  assuming  that  the  magnitudes  |G(u}|  are 

19 

identically  distributed  random  variables  independent  of  e(u), 
ECAg'(x)3  -  E  [^(aG')]  -  <0F[E(aG')3  ^  gP^E(aG^)  E  (h)] 

-  ^  ^  aG(u)j  E  J  ag(x)E  (A7) 

That  is,  the  expected  change  of  the  output  is  a  times  the  change  of 
the  input,  giving  us  the  second  term  in  Eq.  (20),  where  a  » 

(1/2)E{K/ |G  1).  After  a  few  iterations,  lG{u)l  will  not  differ 
greatly  from  K;  then  a  »  1/2. 

Similarly,  the  variance  of  the  change  of  the  output  can  be  shown 
to  be^® 

E[lag'(x)|2  -  |ECag‘{x)3l2] 

"  i  ■  ■  ['  (l^j  ■  i  (*8) 

where  A  is  the  area  of  the  image.  That  is,  the  variance  of  the 
change  of  the  output  Ag'(x)  at  any  given  x  is  proportional  to  the 
integrated  squared  change  of  the  entire  input.  The  predictability 
of  ag'(x),  and  the  degree  of  control  with  which  one  can  manipulate 
it,  decreases  as  one  makes  larger  changes  of  the  input.  The  differ¬ 
ence  between  the  actual  change  of  the  output  and  the  expected  change 
of  the  output  given  by  Eq.  {A7)  is  what  is  meant  by  the  additional 


noise  term  in  Eq.  (20).  If,  after  a  few  iterations,  l6(u)l  =  K,  then 
in  Eq.  (A8)  the  factor  (l/4){2E(K^/l6l^)  -  [E(K/|G|)]^}  =  1/4. 

Equations  (A7)  and  (A8)  are  a  justification  for  the  input-output 
concept:  small  changes  of  the  input  result  in  similar  changes  of 
the  output,  and  so  the  output  can  be  driven  to  satisfy  the  con¬ 
straints  by  appropriate  changes  of  the  input,  as  in  Eqs.  (22)  and 
(23). 
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FIGURE  CAPTIONS 


1.  Block  diagram  of  the  iterative  error-reduction  algorithm. 

2.  Method  of  successive  approximations  for  solving  4y^  -  4y  +  1 

-  0. 

3.  Bird  transforms  into  fish  (“Sky  and  Water"  by  M.C.  Escher). 

This  reproduction  was  authorized  by  the  M.C.  Escher  Foundation, 
The  Hague,  Holland/G.W.  Breughel. 

4.  Example  of  spectrum  shaping,  (a)  Bird  hologram  and  desired  fish 
image;  (b)  fish  output  image  after  random  phase  coding  of  input; 
(c)  output  image  after  seven  iterations  of  the  iterative 
algorithm. 

5.  Computer-simulated  images  from  kinoform.  (a)  object  random 
phase  coded;  (b)  after  8  iterations  of  the  iterative  algorithm. 

6.  Functions  (a)  and  (b)  having  the  same  Fourier  modulus. 

7.  Reconstruction  of  a  nonnegative  function  from  its  Fourier  modu¬ 
lus.  (a)  Test  object;  (b)  modulus  of  its  Fourier  transform; 

(c)  initial  estimate  of  the  object  (first  test);  (d)-(f)  recon¬ 
struction  results — number  of  iterations;  (d)  20,  (e)  230,  (f) 
600;  (g)  initial  estimate  of  the  object  (second  test);  (h)-(i) 
reconstruction  results — number  of  iterations;  (h)  2,  (i)  215. 

8.  Computer-simulated  images  from  hologram  with  4  magnitude  and  4 
phase  quantized  levels,  (a)  object  random  phased  coded;  (b) 
after  13  iterations  of  the  iterative  method. 

9.  Phasor  diagram  of  functions  for  convergence  proof,  (a)  At  one 
point  in  the  function  domain;  (b)  At  one  point  in  the  Fourier 
domain. 

10.  Block  diagram  of  the  system  for  the  input-output  concept. 

11.  Block  diagram  of  the  error-reduction  algorithm  modified  to  in¬ 
clude  a  relaxation  step. 

A1.  A  change  ag  of  the  input  results  in  a  change  aG'  of  the  kinoform 
and  a  change  of  ag'  of  the  output. 

A2  Relationship  between  aG',  the  change  of  the  kinoform,  and  two 
components  of  aG,  the  Fourier  transform  of  the  change  of  the 
input. 
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Figure!.  Block  diagram  of  the  iterative  error-reduction  algorithm. 
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Figure  4.  Example  of  spectrum  shaping,  (a)  Bird  hologram  and  desired  fish 
image;  (b)  fish  output  image  after  random  phase  coding  of  input; 
(c)  output  image  after  seven  iterations  of  the  iterative 
al  gorithm. 
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Figure  7.  Reconstruction  of  a  nonnegative  function  from  its  Fourier  modu¬ 
lus.  (a)  Test  object;  (b)  modulus  of  its  Fourier  transform; 

(c)  initial  estimate  of  the  object  (firsi  test);  (d)-(f)  recon¬ 
struction  results — number  of  iterations;  (d)  20,  (e)  230,  (f) 
600;  (g)  initial  estimate  of  the  object  (second  test);  (h)-(i) 
reconstruction  results — number  of  iterations;  (h)  2,  (i)  215. 
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Figure  8.  Computer-simulated  images  from  hologram  with  4  magnitude  and  4 
phase  quantized  levels,  (a)  object  random  phased  coded;  (b) 
after  13  iterations  of  the  iterative  method. 
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Figure  10.  Block  diagram  of  the  system  for  the  input-output  concept. 


n.  Block  diagram  of  the  error-reduction  algorithm  modified  to  in¬ 
clude  a  relaxation  step. 


t  results  in  a  change  aG'  of  the  kinoform 
the  output. 


